Sayari Logo

Sayari

Senior Data Engineer

Posted 25 Days Ago
Remote
Hiring Remotely in United States
140K-160K Annually
Senior level
Remote
Hiring Remotely in United States
140K-160K Annually
Senior level
Build and maintain scalable ETL pipelines using Python, Spark, and Airflow; collaborate with AI/ML and Product teams to deliver AI-native data products; identify and resolve ETL bottlenecks; ensure code quality through reviews and tests; own sprint deliverables and contribute to roadmap planning and major epics.
The summary above was generated by AI
About Sayari: 

Sayari is the judgment infrastructure for trustworthy AI in economic security and commercial risk. The Sayari Commercial World Model resolves 11.7B+ primary-source records from 250+ jurisdictions forming the ground truth of global commerce. A Judgment Ontology, encoding over a decade of investigative tradecraft, and Superconductor, an agentic orchestration platform, deliver AI that reasons like an expert analyst, shows its work, and traces every finding to its source. Trusted by U.S. Customs and Border Protection, HM Revenue & Customs, and Fortune 500 enterprises, Sayari is used by thousands of professionals across 35+ countries to secure supply chains and dismantle illicit networks. Headquartered in Washington, D.C., with offices in London, Singapore, Tokyo, and Tel Aviv.

POSITION DESCRIPTION

As a Data Engineer at Sayari, you will be the engine behind the world’s most comprehensive commercial world model. You will join a high-autonomy team responsible for building and scaling the complex orchestration systems that transform billions of primary-source records into actionable intelligence. This is a role for a "builder" who respects the complexity of large-scale ETL and graph databases and is "PhD-curious" about the future of AI-native data products and modern orchestration.

JOB RESPONSIBILITIES
  • Design, build, and maintain scalable data pipelines using Python, Spark, and Airflow to support our core data acquisition and entity resolution engines.
  • Collaborate cross-functionally with AI/ML and Product teams to implement new features and AI-native products.
  • Proactively identify and resolve bottlenecks in our complex ETL processes, bringing a fresh perspective to refine and optimize our existing codebase.
  • Contribute to a robust engineering culture through rigorous code reviews, unit testing, and clear communication of design decisions.
  • Own the end-to-end delivery of roadmap tasks within two-week sprints, ensuring work meets high standards for quality, documentation, and performance.
  • Participate in roadmap planning and story refinement, eventually taking ownership of major epics that drive our long-term product defensibility.
SKILLS & EXPERIENCE

Required

  • 5 or more years of production data engineering experience, with clear ownership of systems you built and operated end to end
  • Strong Python, with meaningful experience in a JVM language (Scala preferred) or willingness to ramp quickly
  • Hands-on Snowflake experience, or equivalent depth in BigQuery or Redshift with demonstrated ability to transfer
  • Experience deploying and operating AI or ML applications in production, including output validation, monitoring, and cost management at scale
  • Orchestration experience with Apache Airflow or a comparable workflow tool
  • Track record of operating production systems reliably, with comfort navigating failure, monitoring, and recovery

Preferred

  • Experience with Spark on Dataproc Serverless or other serverless Spark environments
  • Familiarity with Kubernetes for deployment
  • Experience with data quality tooling such as deequ, Great Expectations, or equivalent
  • GCP experience (BigQuery, Dataproc, Cloud Storage)
  • Experience leading or contributing to a data warehouse migration
  • Background in team mergers or migrating a team onto a new operating process

The target base salary for this position is $140,000-$160,000 plus company bonus and equity. Final offer amounts are determined by multiple factors including location, local market variances, candidate experience and expertise, internal peer equity, and may vary from the amounts listed above.


Benefits: 
  • 100% fully paid medical, vision, and dental for employees and their dependents
  • Generous time off; we observe all US federal holidays, close our office for a winter break (12/24-12/31), in addition to granting 18 PTO days and 10 sick days
  • Outstanding compensation package; competitive commissions for revenue roles and bonuses for non-revenue positions
  • A strong commitment to diversity, equity, and inclusion
  • Eligibility to participate in additional benefits such as 401k match up to 5%, 100% paid life insurance (up to $100,000 coverage),, and parental leave
  • A collaborative and positive culture - your team will be as smart and driven as you
  • Limitless growth and learning opportunities
 
Sayari is an equal opportunity employer and strongly encourages diverse candidates to apply. We believe diversity and inclusion mean our team members should reflect the diversity of the United States. No employee or applicant will face discrimination or harassment based on race, color, ethnicity, religion, age, gender, gender identity or expression, sexual orientation, disability status, veteran status, genetics, or political affiliation. We strongly encourage applicants of all backgrounds to apply.
Pay Range
$140,000$160,000 USD

Similar Jobs

An Hour Ago
In-Office or Remote
CA, USA
168K-297K Annually
Senior level
168K-297K Annually
Senior level
Blockchain • eCommerce • Fintech • Payments • Software • Financial Services • Cryptocurrency
Design and maintain data architecture and pipelines to support compliance and risk teams. Build and optimize data models, standardize metrics, and create data dictionaries. Implement data quality, lineage monitoring, AI-driven agents for false-positive reduction and automation, and participate in on-call rotations to ensure SLAs are met.
Top Skills: AirflowDatabricksDbtGitOmniPrefectPythonSnowflakeSQLTerraform
Yesterday
Remote or Hybrid
CA, USA
168K-297K Annually
Senior level
168K-297K Annually
Senior level
Blockchain • Fintech • Mobile • Payments • Software • Financial Services
Lead design and optimization of data models and pipelines for compliance and risk; standardize metrics and documentation; build data quality, lineage, and monitoring (including AI agents for automation); manage ETL scheduling, on-call pipeline support, and collaborate with product and non-technical partners to translate business needs into automated, production-ready data solutions.
Top Skills: AirflowDatabricksDbtGitOmniPrefectPythonSnowflakeSQLTerraform
4 Days Ago
Remote or Hybrid
US
135K-155K Annually
Senior level
135K-155K Annually
Senior level
Professional Services • Software
Lead architecture and buildout of a new graph-backed enterprise data platform: design ingestion, graph and relational storage, entity resolution pipelines, temporal models, ETL/ELT pipelines, governance, APIs, and production connectors. Ship scalable graph data models, traversal queries, and platform roadmap while enabling observability, security, and containerized deployments.
Top Skills: AirflowAzureCypherDagsterDbtDockerGremlinHelmJavaKubernetesPythonSalesforceServicenowSparqlSQL

What you need to know about the Charlotte Tech Scene

Ranked among the hottest tech cities in 2024 by CompTIA, Charlotte is quickly cementing its place as a major U.S. tech hub. Home to more than 90,000 tech workers, the city’s ecosystem is primed for continued growth, fueled by billions in annual funding from heavyweights like Microsoft and RevTech Labs, which has created thousands of fintech jobs and made the city a go-to for tech pros looking for their next big opportunity.

Key Facts About Charlotte Tech

  • Number of Tech Workers: 90,859; 6.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Lowe’s, Bank of America, TIAA, Microsoft, Honeywell
  • Key Industries: Fintech, artificial intelligence, cybersecurity, cloud computing, e-commerce
  • Funding Landscape: $3.1 billion in venture capital funding in 2024 (CED)
  • Notable Investors: Microsoft, Google, Falfurrias Management Partners, RevTech Labs Foundation
  • Research Centers and Universities: University of North Carolina at Charlotte, Northeastern University, North Carolina Research Campus

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account