Reddit Logo

Reddit

Senior Data Engineer, ML Platform

Reposted 6 Days Ago
Easy Apply
Remote or Hybrid
Hiring Remotely in United States
191K-267K Annually
Senior level
Easy Apply
Remote or Hybrid
Hiring Remotely in United States
191K-267K Annually
Senior level
Lead development of data pipelines and workflows for ML models, design scalable data processing environments, and ensure high data quality.
The summary above was generated by AI
Reddit is a community of communities. It’s built on shared interests, passion, and trust, and is home to the most open and authentic conversations on the internet. Every day, Reddit users submit, vote, and comment on the topics they care most about. With 100,000+ active communities and approximately 116 million daily active unique visitors, Reddit is one of the internet’s largest sources of information. For more information, visit www.redditinc.com.

Who We Are:
The Machine Learning Platform team at Reddit is a high-impact team that owns the infrastructure that powers recommendations, content discovery, user and content quantification, while directly impacting other teams such as Growth, Ads, Feeds, and Core Machine Learning teams.

What You’ll Do:
As a Senior Data Engineer, you will lead development of data pipelines and workflow for large scale ML models at Reddit.

  • Design and implement scalable and secure data processing pipelines and storage environments that prepare our source of truth datasets for our models.
  • Ensure data is cleansed, mapped, transformed, and otherwise optimized for storage and use according to business and technical requirements.
  • Build effective data pipelines and workflows to streamline data ingestion, processing, and distribution tasks.
  • Setting up and operating data workflow management tools for SQL code versioning, dependency tracing, etc
  • Load transformed data into storage and reporting structures in destinations including data warehouse, reporting systems and analytics applications.
  • Monitor and troubleshoot issues with the data environment to maintain high availability and performance.
  • Support monitoring and observability across training datasets, model metrics and implement diagnostic tools for metric movements.
  • Maintain effective documentation regarding data procedures, systems, and architectures to maintain clarity and enable easy collaboration.

Who You Might Be:

  • 5+ years of experience in Data Engineering or ML Infrastructure
  • Experience with large scale data transforms to prepare graph data
  • Experience with Graph DB, Spark, Kafka pipelines
  • Experience working with Airflow and MLFlow
  • Experience with storage frameworks like BQ, parquet, iceberg
  • Awareness of ML models and architectures is a huge plus.
  • Strong focus on scalability, reliability, performance, and ease of use. You are an undying advocate for platform users and have a deep intuition for the machine learning development lifecycle.
  • Strong organizational & communication skills

Benefits:

  • Comprehensive Healthcare Benefits and Income Replacement Programs
  • 401k Match
  • Family Planning Support
  • Gender-Affirming Care
  • Mental Health & Coaching Benefits
  • Flexible Vacation & Reddit Global Days off
  • Generous paid Parental Leave  
  • Paid Volunteer time off


Pay Transparency:

This job posting may span more than one career level.

In addition to base salary, this job is eligible to receive equity in the form of restricted stock units, and depending on the position offered, it may also be eligible to receive a commission. Additionally, Reddit offers a wide range of benefits to U.S.-based employees, including medical, dental, and vision insurance, 401(k) program with employer match, generous time off for vacation, and parental leave. To learn more, please visit https://www.redditinc.com/careers/.

To provide greater transparency to candidates, we share base pay ranges for all US-based job postings regardless of state. We set standard base pay ranges for all roles based on function, level, and country location, benchmarked against similar stage growth companies. Final offer amounts are determined by multiple factors including, skills, depth of work experience and relevant licenses/credentials, and may vary from the amounts listed below.

The base pay range for this position is:
$190,800$267,100 USD

In select roles and locations, the interviews will be recorded, transcribed and summarized by artificial intelligence (AI). You will have the opportunity to opt out of recording, transcription and summarization prior to any scheduled interviews.

During the interview, we will collect the following categories of personal information: Identifiers, Professional and Employment-Related Information, Sensory Information (audio/video recording), and any other categories of personal information you choose to share with us. We will use this information to evaluate your application for employment or an independent contractor role, as applicable.  We will not sell your personal information or disclose it to any third party for their marketing purposes.  We will delete any recording of your interview promptly after making a hiring decision.  For more information about how we will handle your personal information, including our retention of it, please refer to our Candidate Privacy Policy for Potential Employees and Contractors.

Reddit is proud to be an equal opportunity employer, and is committed to building a workforce representative of the diverse communities we serve.  Reddit is committed to providing reasonable accommodations for qualified individuals with disabilities and disabled veterans in our job application procedures. If, due to a disability, you need an accommodation during the interview process, please let your recruiter know.

Top Skills

Airflow
Bq
Graph Db
Iceberg
Kafka
Mlflow
Parquet
Spark

Similar Jobs

12 Days Ago
Easy Apply
Remote or Hybrid
United States
Easy Apply
217K-303K Annually
Senior level
217K-303K Annually
Senior level
Information Technology • Mobile • News + Entertainment • Social Media
Lead machine learning projects from conception to production, enhance ranking systems, mentor team members, and implement high-performance distributed systems.
Top Skills: Machine LearningPythonStatistical Analysis
An Hour Ago
Remote or Hybrid
Richmond, VA, USA
141K-160K Annually
Senior level
141K-160K Annually
Senior level
Fintech • Machine Learning • Payments • Software • Financial Services
The Senior Manager, Technical Writer will create strategic technical documentation for Capital One's tools, ensuring clear communication for users and stakeholders. Responsibilities include crafting user manuals, API documentation, and collaborating with technical teams to ensure documentation effectiveness.
Top Skills: ConfluenceGitGitbookHugoMarkdown
3 Hours Ago
Remote or Hybrid
IL, USA
Mid level
Mid level
Artificial Intelligence • eCommerce • Information Technology • Internet of Things • Automation
The Manager, Practice Solutions - Microsoft leads a team to drive Microsoft's business growth, focusing on strategic programs, account management, and partnership development to enhance customer adoption of Microsoft solutions.
Top Skills: MS OfficeMicrosoft Solutions

What you need to know about the Charlotte Tech Scene

Ranked among the hottest tech cities in 2024 by CompTIA, Charlotte is quickly cementing its place as a major U.S. tech hub. Home to more than 90,000 tech workers, the city’s ecosystem is primed for continued growth, fueled by billions in annual funding from heavyweights like Microsoft and RevTech Labs, which has created thousands of fintech jobs and made the city a go-to for tech pros looking for their next big opportunity.

Key Facts About Charlotte Tech

  • Number of Tech Workers: 90,859; 6.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Lowe’s, Bank of America, TIAA, Microsoft, Honeywell
  • Key Industries: Fintech, artificial intelligence, cybersecurity, cloud computing, e-commerce
  • Funding Landscape: $3.1 billion in venture capital funding in 2024 (CED)
  • Notable Investors: Microsoft, Google, Falfurrias Management Partners, RevTech Labs Foundation
  • Research Centers and Universities: University of North Carolina at Charlotte, Northeastern University, North Carolina Research Campus

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account