Scribd Logo

Scribd

Principal Data Engineer / Architect

Job Posted 5 Days Ago Reposted 5 Days Ago
Remote
3 Locations
158K-260K Annually
Senior level
Remote
3 Locations
158K-260K Annually
Senior level
Lead the design and development of data architecture, focusing on data modeling, integration, and analytics for Scribd's data strategy.
The summary above was generated by AI

At Scribd (pronounced “scribbed”), our mission is to spark human curiosity. Join our team as we create a world of stories and knowledge, democratize the exchange of ideas and information, and empower collective expertise through our three products: Everand, Scribd, and Slideshare. 


We support a culture where our employees can be real and be bold; where we debate and commit as we embrace plot twists; and where every employee is empowered to take action as we prioritize the customer.


When it comes to workplace structure, we believe in balancing individual flexibility and community connections.  It’s through our flexible work benefit, Scribd Flex, that employees – in partnership with their manager – can choose the daily work-style that best suits their individual needs. A key tenet of Scribd Flex is our prioritization of intentional in-person moments to build collaboration, culture, and connection. For this reason, occasional in-person attendance is required for all Scribd employees, regardless of their location.

  

So what are we looking for in new team members? Well, we hire for “GRIT”. The textbook definition of GRIT is demonstrating the intersection of passion and perseverance towards long term goals. At Scribd, we are inspired by the potential that this can unlock, and ask each of our employees to pursue a GRIT-ty approach to their work. In a tactical sense, GRIT is also a handy acronym that outlines the standards we hold ourselves and each other to.  Here’s what that means for you: we’re looking for someone who showcases the ability to set and achieve Goals, achieve Results within their job responsibilities, contribute Innovative ideas and solutions, and positively influence the broader Team through collaboration and attitude.


What You'll Do:

As a pivotal member of the team, you will lead the design and development of a robust data architecture that guides data modeling, integration, processing, and delivery standards enabling modern data product development at Scribd.


You will also serve as a data and analytics solution architect, leading architecture initiatives encompassing data warehousing, data pipeline development, data integrations, and data modeling. You will shape Scribd’s data strategy, guiding stakeholders in how they consume and act on data.


We’re looking for someone with proven proficiency in architecting, designing and development experience with batch and real time streaming infrastructure and workloads. Your expertise will help establish standards for data modeling, integration, processing, and delivery and also help translate business requirements into technical specifications.


At Scribd, we leverage deep data insights to inform every aspect of our business, from product development, experimentation, to understanding our subscriber engagement and tracking key performance indicators. You'll join a data engineering team tackling complex challenges within a rich domain encompassing three distinct brands – Scribd, Everand, and Slideshare – all serving a massive user base with over 200 million monthly visitors and 2 million paying subscribers. You'll have the opportunity to make a real impact as we are heavily investing in improving our core data layer and this exciting new role puts you right at the forefront of this initiative.


Based on the project, this might involve cross-functional work with the Data Science, Analytics, and other Engineering and Business teams to design cohesive data models, database schemas and data storage solutions, consumption strategies and patterns. Almost everything you will be working on will be to increase the "customer satisfaction" for internal customers of Scribd data.


Required Skills:

• 10+ years of experience in data engineering, with a strong background in data architecture, data modeling, and data management, building and scaling robust data systems for complex business domains.

• Expertise in Scala or Python, with a deep understanding and hands-on experience in Spark for designing, optimizing, and scaling large-scale data processing pipelines, and proficiency in at least one SQL dialect.

• Experience with data lake technologies (e.g., Databricks, Delta Lake), data storage formats (Parquet, Avro), query engines (such as Photon, Spark SQL), and both real-time streaming and batch processing, or equivalent technologies and frameworks.


Desired Skills:

• Experience and working knowledge of streaming platforms, typically based around Kafka.

• Strong grasp of AWS data platform services and their strengths/weaknesses.

• Hands on experience in implementing data pipelines for data ingestion and transformation to support analytics and ML pipelines

• Strong experience communicating asynchronously using collaboration tools like Jira, Slack, etc.

• Experience using automation and CI/CD tooling like Git, GitHub,Docker,Jenkins, Terraform, etc.

• Experience developing standards for database design and implementation of various strategic data architecture initiatives around data quality, data management policies/standards, data governance, privacy and metadata management

• Working experience integrating with BI frameworks like Qlik, ThoughtSpot, Looker, Tableau, etc.

At Scribd, your base pay is one part of your total compensation package and is determined within a range. Our pay ranges are based on the local cost of labor benchmarks for each specific role, level, and geographic location. San Francisco is our highest geographic market in the United States.


In the state of California, the reasonably expected salary range is between $191,500 [minimum salary in our lowest geographic market within California] to $259,500 [maximum salary in our highest geographic market within California]. 


In the United States, outside of California, the reasonably expected salary range is between $158,000 [minimum salary in our lowest US geographic market outside of California] to $247,000 [maximum salary in our highest US geographic market outside of California]. 


In Canada, the reasonably expected salary range is between $198,500 CAD[minimum salary in our lowest geographic market] to $246,000 CAD[maximum salary in our highest geographic market]. 


We carefully consider a wide range of factors when determining compensation, including but not limited to experience; job-related skill sets; relevant education or training; and other business and organizational needs. The salary range listed is for the level at which this job has been scoped. In the event that you are considered for a different level, a higher or lower pay range would apply. This position is also eligible for a competitive equity ownership, and a comprehensive and generous benefits package.


Benefits, Perks, and Wellbeing at Scribd

*Benefits/perks listed may vary depending on the nature of your employment with Scribd and the geographical location where you work.

• Healthcare Insurance Coverage (Medical/Dental/Vision): 100% paid for employees

• 12 weeks paid parental leave

• Short-term/long-term disability plans

• 401k/RSP matching

• Onboarding stipend for home office peripherals + accessories

• Tuition Reimbursement

• Learning & Development programs

• Quarterly stipend for Wellness, Connectivity & Comfort

• Mental Health support & resources

• Free subscription to Scribd + gift memberships for friends & family

• Referral Bonuses

• Book Benefit

• Sabbaticals

• Company wide events

• Team engagement budgets

• Vacation & Personal Days

• Paid Holidays (+ winter break)

• Flexible Sick Time

• Volunteer Day

Company-wide Employee Resource Groups and programs that foster an inclusive and diverse workplace.


Want to learn more about life at Scribd? www.linkedin.com/company/scribd/life


---------------------------------------------------------------------------------------------------------------------------

We want our interview process to be accessible to everyone. You can inform us of any reasonable adjustments we can make to better accommodate your needs by emailing accommodations [@] scribd.com about the need for adjustments at any point in the interview process.


Scribd is committed to equal employment opportunity regardless of race, color, religion, national origin, gender, sexual orientation, age, marital status, veteran status, disability status, or any other characteristic protected by law. We encourage people of all backgrounds to apply, and believe that a diversity of perspectives and experiences create a foundation for the best ideas. Come join us in building something meaningful.

---------------------------------------------------------------------------------------------------------------------------


Remote employees must have their primary residence in:  Arizona, California, Colorado, Connecticut, DC, Florida, Georgia, Hawaii, Maryland, Massachusetts, Michigan, Missouri, New Jersey, New York, Ohio, Oregon, Tennessee, Texas, Utah, Washington, Ontario (Canada), British Columbia (Canada), or Mexico

 #LI-Remote

Top Skills

Avro
AWS
Databricks
Delta Lake
Docker
Git
Git
Jenkins
Kafka
Looker
Parquet
Python
Qlik
Scala
Spark
SQL
Tableau
Terraform
Thoughtspot

Similar Jobs

5 Days Ago
Remote
USA
187K-298K Annually
Senior level
187K-298K Annually
Senior level
Other • Real Estate • PropTech
The Principal SDE will architect scalable data models, develop data solutions, and lead initiatives while collaborating with teams across Zillow for data utilization optimization.
Top Skills: AirflowAWSData ModelingDatabricksSparkSQL
4 Hours Ago
Easy Apply
Remote
United States
Easy Apply
Senior level
Senior level
Fintech • Healthtech • Software
As a Data Scientist III, you will design and implement scalable analytics solutions, lead initiatives, mentor junior members, and provide data insights to enhance client relations and business outcomes.
Top Skills: DbtLookerPower BIPythonSQLTableau
6 Hours Ago
Remote
San Francisco, CA, USA
136K-218K Annually
Senior level
136K-218K Annually
Senior level
Cloud • Information Technology • Productivity • Security • Software • App development • Automation
As a Senior Data Engineer, you'll design data models, build scalable data pipelines, and improve data quality, collaborating across teams.
Top Skills: AirflowSparkAthenaAws Data ServicesEmrFlinkHiveJavaKafkaPythonRedshiftSparkSQL

What you need to know about the Seattle Tech Scene

Home to tech titans like Microsoft and Amazon, Seattle punches far above its weight in innovation. But its surrounding mountains, sprinkled with world-famous hiking trails and climbing routes, make the city a destination for outdoorsy types as well. Established as a logging town before shifting to shipbuilding and logistics, the Emerald City is now known for its contributions to aerospace, software, biotech and cloud computing. And its status as a thriving tech ecosystem is attracting out-of-town companies looking to establish new tech and engineering hubs.

Key Facts About Seattle Tech

  • Number of Tech Workers: 287,000; 13% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Amazon, Microsoft, Meta, Google
  • Key Industries: Artificial intelligence, cloud computing, software, biotechnology, game development
  • Funding Landscape: $3.1 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Madrona, Fuse, Tola, Maveron
  • Research Centers and Universities: University of Washington, Seattle University, Seattle Pacific University, Allen Institute for Brain Science, Bill & Melinda Gates Foundation, Seattle Children’s Research Institute
By clicking Apply you agree to share your profile information with the hiring company.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account