Scribd Logo

Scribd

Software Engineer II (Python and Data pipelines)

Job Posted 9 Days Ago Posted 9 Days Ago
Remote
3 Locations
104K-196K Annually
Mid level
Remote
3 Locations
104K-196K Annually
Mid level
As a Software Engineer II, you'll design and develop data pipelines for metadata processing, collaborate with cross-functional teams, and optimize large-scale systems.
The summary above was generated by AI

At Scribd (pronounced “scribbed”), our mission is to spark human curiosity. Join our team as we create a world of stories and knowledge, democratize the exchange of ideas and information, and empower collective expertise through our three products: Everand, Scribd, and Slideshare. 


We support a culture where our employees can be real and be bold; where we debate and commit as we embrace plot twists; and where every employee is empowered to take action as we prioritize the customer.


When it comes to workplace structure, we believe in balancing individual flexibility and community connections.  It’s through our flexible work benefit, Scribd Flex, that employees – in partnership with their manager – can choose the daily work-style that best suits their individual needs. A key tenet of Scribd Flex is our prioritization of intentional in-person moments to build collaboration, culture, and connection. For this reason, occasional in-person attendance is required for all Scribd employees, regardless of their location.

  

So what are we looking for in new team members? Well, we hire for “GRIT”. The textbook definition of GRIT is demonstrating the intersection of passion and perseverance towards long term goals. At Scribd, we are inspired by the potential that this can unlock, and ask each of our employees to pursue a GRIT-ty approach to their work. In a tactical sense, GRIT is also a handy acronym that outlines the standards we hold ourselves and each other to.  Here’s what that means for you: we’re looking for someone who showcases the ability to set and achieve Goals, achieve Results within their job responsibilities, contribute Innovative ideas and solutions, and positively influence the broader Team through collaboration and attitude.


About the team:

The ML Data Engineering team is at the heart of metadata extraction and enrichment for all of our brands, managing and processing hundreds of millions of documents, billions of images, and serving millions of users. We operate at an unparalleled scale, handling diverse datasets, including UGC documents, ebooks, audiobooks, and more. Our goal is to build robust systems that drive content discovery, trust, and structured metadata across our platforms.


Role Overview:

We are seeking a Software Engineer II with a strong background in data engineering, software development, and scalable systems. As part of the ML Data Engineering team, you will work on designing, building, and optimizing systems that extract, enrich, and process metadata at scale. You’ll collaborate closely with machine learning teams, product managers, and other engineers to ensure the smooth integration and processing of vast amounts of structured metadata.


Tech Stack:

Our team uses various technologies. The following are the ones that we use on a regular basis: Python, Scala, Ruby on Rails, Airflow, Databricks, Spark, HTTP APIs, AWS (Lambda, ECS, SQS, ElastiCache, Sagemaker, Cloudwatch, Datadog) and Terraform.

Responsibilities

  • Design and develop data pipelines to extract, enrich, and process metadata from millions of documents, images, and other content types.
  • Collaborate with cross-functional teams, including ML engineers and product managers, to deliver scalable, efficient, and reliable metadata solutions.
  • Build and maintain systems that operate at a massive scale, handling hundreds of millions of documents and billions of images.
  • Optimize and refactor existing systems for performance, scalability, and reliability.
  • Ensure data accuracy, integrity, and quality through automated validation and monitoring.
  • Participate in code reviews, ensuring best practices are followed and maintaining high-quality standards in the codebase.
  • Manage and maintain data pipelines, security and infrastructure.

Requirements

  • 4+ years of experience in backend software engineering, with hands-on work in developing data pipelines and building and deploying your own infrastructure
  • Proficient in one or more programming languages, such as Python, Ruby or similar
  • Experience working with a public cloud provider (AWS, Azure, or Google Cloud)
  • Hands-on experience with building, deploying, and optimizing solutions using ECS, EKS or AWS Lambdas
  • Experience with queueing and streaming technologies like SQS, Sidekiq, Kafka or Kinesis
  • Experience working with systems at scale such as External APIs, and data transformations
  • Proven ability to test and optimize systems for performance and scalability.
  • Bachelor’s in CS or equivalent professional experienceBonus points if you have experience working with Machine Learning systems

At Scribd, your base pay is one part of your total compensation package and is determined within a range. Our pay ranges are based on the local cost of labor benchmarks for each specific role, level, and geographic location. San Francisco is our highest geographic market in the United States. In the state of California, the reasonably expected salary range is between $126,000 [minimum salary in our lowest geographic market within California] to $196,000 [maximum salary in our highest geographic market within California]. 


In the United States, outside of California, the reasonably expected salary range is between $103,500 [minimum salary in our lowest US geographic market outside of California] to $186,500 [maximum salary in our highest US geographic market outside of California]. 


In Canada, the reasonably expected salary range is between $131,500 CAD[minimum salary in our lowest geographic market] to $174,500 CAD[maximum salary in our highest geographic market]. 


We carefully consider a wide range of factors when determining compensation, including but not limited to experience; job-related skill sets; relevant education or training; and other business and organizational needs. The salary range listed is for the level at which this job has been scoped. In the event that you are considered for a different level, a higher or lower pay range would apply. This position is also eligible for a competitive equity ownership, and a comprehensive and generous benefits package.


Benefits, Perks, and Wellbeing at Scribd

*Benefits/perks listed may vary depending on the nature of your employment with Scribd and the geographical location where you work.

• Healthcare Insurance Coverage (Medical/Dental/Vision): 100% paid for employees

• 12 weeks paid parental leave

• Short-term/long-term disability plans

• 401k/RSP matching

• Onboarding stipend for home office peripherals + accessories

• Tuition Reimbursement

• Learning & Development programs

• Quarterly stipend for Wellness, Connectivity & Comfort

• Mental Health support & resources

• Free subscription to Scribd + gift memberships for friends & family

• Referral Bonuses

• Book Benefit

• Sabbaticals

• Company wide events

• Team engagement budgets

• Vacation & Personal Days

• Paid Holidays (+ winter break)

• Flexible Sick Time

• Volunteer Day

Company-wide Employee Resource Groups and programs that foster an inclusive and diverse workplace.


Want to learn more about life at Scribd? www.linkedin.com/company/scribd/life


---------------------------------------------------------------------------------------------------------------------------

We want our interview process to be accessible to everyone. You can inform us of any reasonable adjustments we can make to better accommodate your needs by emailing accommodations [@] scribd.com about the need for adjustments at any point in the interview process.


Scribd is committed to equal employment opportunity regardless of race, color, religion, national origin, gender, sexual orientation, age, marital status, veteran status, disability status, or any other characteristic protected by law. We encourage people of all backgrounds to apply, and believe that a diversity of perspectives and experiences create a foundation for the best ideas. Come join us in building something meaningful.

---------------------------------------------------------------------------------------------------------------------------


Remote employees must have their primary residence in:  Arizona, California, Colorado, Connecticut, DC, Florida, Georgia, Hawaii, Maryland, Massachusetts, Michigan, Missouri, New Jersey, New York, Ohio, Oregon, Tennessee, Texas, Utah, Washington, Ontario (Canada), British Columbia (Canada), or Mexico

 #LI-Remote

Top Skills

Airflow
Aws (Lambda
Cloudwatch
Databricks
Datadog)
Ecs
Elasticache
Http Apis
Python
Ruby On Rails
Sagemaker
Scala
Spark
Sqs
Terraform

Similar Jobs

3 Hours Ago
Easy Apply
Remote
3 Locations
Easy Apply
Senior level
Senior level
AdTech • Big Data • Machine Learning • Marketing Tech • Mobile • Software
As a backend engineer, you will manage Liftoff's data infrastructure, improving tools and processes for higher efficiency and performance. Responsibilities include building core engines and collaborating with the product team on strategy.
Top Skills: Data Processing PipelinesDelta LakeDistributed SystemsHive MetastoreIcebergRedisSparkTrino
3 Hours Ago
Remote
Hybrid
Pleasanton, CA, USA
133K-167K Annually
Senior level
133K-167K Annually
Senior level
Cloud • Fintech • Information Technology • Machine Learning • Software • App development • Generative AI
Lead performance testing strategies, develop tools, analyze results, collaborate across teams, and document processes to enhance software reliability and scalability.
Top Skills: Automated ScriptingPerformance TestingSoftware Testing Tools
3 Hours Ago
Remote
Hybrid
7 Locations
155K-270K Annually
Senior level
155K-270K Annually
Senior level
Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
The role involves designing and developing cloud-native microservices for a Next-Gen SIEM platform, leading complex projects, mentoring junior engineers, and ensuring software engineering best practices.
Top Skills: C#DockerGoGrafanaJavaKafkaKubernetesOpensearchPostgresPythonRedis

What you need to know about the Seattle Tech Scene

Home to tech titans like Microsoft and Amazon, Seattle punches far above its weight in innovation. But its surrounding mountains, sprinkled with world-famous hiking trails and climbing routes, make the city a destination for outdoorsy types as well. Established as a logging town before shifting to shipbuilding and logistics, the Emerald City is now known for its contributions to aerospace, software, biotech and cloud computing. And its status as a thriving tech ecosystem is attracting out-of-town companies looking to establish new tech and engineering hubs.

Key Facts About Seattle Tech

  • Number of Tech Workers: 287,000; 13% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Amazon, Microsoft, Meta, Google
  • Key Industries: Artificial intelligence, cloud computing, software, biotechnology, game development
  • Funding Landscape: $3.1 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Madrona, Fuse, Tola, Maveron
  • Research Centers and Universities: University of Washington, Seattle University, Seattle Pacific University, Allen Institute for Brain Science, Bill & Melinda Gates Foundation, Seattle Children’s Research Institute
By clicking Apply you agree to share your profile information with the hiring company.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account