Upwork Logo

Upwork

Sr Lead Machine Learning Engineer

Posted 22 Days Ago
Easy Apply
Remote or Hybrid
Hiring Remotely in USA
195K-308K Annually
Senior level
Easy Apply
Remote or Hybrid
Hiring Remotely in USA
195K-308K Annually
Senior level
Design and implement evaluation frameworks for AI systems, focusing on human+AI collaboration. Lead statistical analysis and benchmarking efforts, working cross-functionally to enhance AI model performance.
The summary above was generated by AI

Upwork ($UPWK) is the world’s human and AI-powered work marketplace that connects businesses with highly skilled, AI-enabled independent talent from across the globe. From entrepreneurs to Fortune 100 enterprises, companies rely on Upwork’s trusted platform and its mindful AI companion, Uma, to find and hire expert talent, leverage AI-powered work solutions, and drive business transformation. With on-demand access to professionals spanning more than 10,000 skills across AI & machine learning, software development, sales & marketing, customer support, finance & accounting, and more, Upwork enables businesses of all sizes to scale, innovate, and build agile teams for the age of AI and beyond.

Upwork’s platform has facilitated more than $25 billion in economic opportunity for talent around the world. Learn more at Upwork.com and follow us on LinkedIn, Facebook, Instagram, TikTok, and X.

We're looking for a Sr Lead MLE/Applied Scientist to define how success is measured for AI agents performing real-world tasks. This role is at the forefront of building trust and quality into agentic systems by crafting rigorous, reproducible evaluation frameworks that shape what we ship. You’ll work cross-functionally to evaluate human+AI collaboration, assess outcomes beyond accuracy metrics, and uncover what’s truly working for freelancers and clients. Join us in revolutionizing agent evaluation and making a measurable impact on AI systems that power the future of work.

Responsibilities
  • Design and implement comprehensive evaluation frameworks that reflect real-world task success for agentic systems, with a focus on human+AI collaboration outcomes
  • Build benchmarking pipelines that capture nuanced success indicators including trust calibration, intervention frequency, and agent handoff quality
  • Lead development of observability tools and instrumentation for analyzing agent behavior in production
  • Translate complex qualitative and quantitative signals into actionable insights that inform model iteration and product prioritization
  • Collaborate with researchers, engineers, and product teams to align evaluation methodologies with business and user goals
  • Own benchmarking infrastructure that enables reproducible, scalable evaluation across AI initiatives
  • Champion rigorous experimental design and statistical analysis across teams to ensure consistent and meaningful measurement standards
What it takes to catch our eye
  • Proven experience designing evaluation systems for agentic or LLM-based AI, ideally in complex, interactive or open-ended environments
  • Deep expertise in statistical experimentation, benchmark creation, and human-AI interaction assessment
  • Fluency in building data pipelines and tooling using Python, SQL, and distributed data processing frameworks
  • Demonstrated ability to influence product and model roadmaps through evaluation insights and performance measurement
  • Adaptive-level proficiency in integrating AI tools into technical workflows for analysis, experimentation, and observability refinement
Come change how the world works.

At Upwork, you’ll shape the future of work for a global, remote-first workforce, creating economic opportunities for professionals worldwide. While we have a physical office in Palo Alto, we currently hire full-time employees in 34 U.S. states, making it easier than ever to join our mission from wherever you call home.

Our culture is built on trust, risk-taking, customer focus, and excellence, all in service of our core mission: to create economic opportunities so people have better lives. We embrace authenticity and inclusion, encouraging everyone to bring their whole selves to work. Personal and professional growth is a priority here, supported through development programs, mentorship, and our Upwork Belonging Communities.

We’re proud to offer benefits that go beyond the basics, including comprehensive medical coverage for you and your family, unlimited PTO, a 401(k) plan with matching, 12 weeks of paid parental leave, and an Employee Stock Purchase Plan. Visit our Life at Upwork page to learn more about our values, working principles, and the overall employee experience.

Ready to help shape the future of work? Check out our Careers page to learn more about opportunities at Upwork.

Upwork is an Equal Opportunity Employer committed to recruiting and retaining a diverse and inclusive workforce. We do not discriminate based on race, religion, color, national origin, gender (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender identity, gender expression, age, status as a protected veteran, status as an individual with a disability, genetic information, or other legally protected characteristics under federal, state, or local law.

Please note that a criminal background check may be required once a conditional job offer is made. Qualified applicants with arrest or conviction records will be considered in accordance with applicable law, including the California Fair Chance Act and local Fair Chance ordinances.

The annual base salary range for this position  is displayed below. The range displayed reflects the minimum and maximum salary for this position, and individual base pay will depend on your skills, qualifications, experience, and location. Additionally, this position is eligible for the annual bonus plan or sales incentive plan and eligibility to participate in our long term equity incentive program.

Annual Base Compensation
$195,000$308,000 USD

Upwork is an Equal Opportunity Employer committed to recruiting and retaining a diverse and inclusive workforce. We do not discriminate based on race, religion, color, national origin, gender (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender identity, gender expression, age, status as a protected veteran, status as an individual with a disability, genetic information, or other legally protected characteristics under federal, state, or local law.

Please note that a criminal background check may be required once a conditional job offer is made. Qualified applicants with arrest or conviction records will be considered in accordance with applicable law, including the California Fair Chance Act and local Fair Chance ordinances. The Company is committed to conducting an individualized assessment and giving all individuals a fair opportunity to provide relevant information or context before making any final employment decision.

To learn more about how Upwork processes and protects your personal information as part of the application process, please review our Global Job Applicant Privacy Notice

Top Skills

Distributed Data Processing Frameworks
Python
SQL

Similar Jobs

30 Minutes Ago
In-Office or Remote
2 Locations
142K-195K Annually
Senior level
142K-195K Annually
Senior level
Artificial Intelligence • Machine Learning
Seeking a Senior Sales Engineer to act as a technical advisor for enterprise clients, delivering AI solutions, conducting demos, and supporting the sales cycle.
Top Skills: AIApi IntegrationAWSAzureCloud ComputingGCPGraphQLLarge Language ModelsOauth 2.0RestRetrieval-Augmented GenerationSaaS
30 Minutes Ago
Remote
2 Locations
170K-234K Annually
Senior level
170K-234K Annually
Senior level
Artificial Intelligence • Machine Learning
The Senior Sales Engineer will act as a technical advisor to enterprise clients, leading technical sales cycles, delivering demos and POCs, and advising on AI adoption strategies.
Top Skills: Api IntegrationAzureCloud Computing (AwsEmotion AiGcp)Generative AiKnowledge AiLarge Language Models (Llms)Retrieval-Augmented Generation (Rag)
30 Minutes Ago
Remote
TX, USA
106K-146K Annually
Senior level
106K-146K Annually
Senior level
Artificial Intelligence • Machine Learning
The Manager of Customer Success will oversee deployment and adoption of AI solutions, manage customer relationships, ensure performance tracking, and collaborate across functions to enhance service delivery.
Top Skills: AdtechAWSAzureCdpGCPHugging FaceMartechOpenaiTensorFlow

What you need to know about the Charlotte Tech Scene

Ranked among the hottest tech cities in 2024 by CompTIA, Charlotte is quickly cementing its place as a major U.S. tech hub. Home to more than 90,000 tech workers, the city’s ecosystem is primed for continued growth, fueled by billions in annual funding from heavyweights like Microsoft and RevTech Labs, which has created thousands of fintech jobs and made the city a go-to for tech pros looking for their next big opportunity.

Key Facts About Charlotte Tech

  • Number of Tech Workers: 90,859; 6.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Lowe’s, Bank of America, TIAA, Microsoft, Honeywell
  • Key Industries: Fintech, artificial intelligence, cybersecurity, cloud computing, e-commerce
  • Funding Landscape: $3.1 billion in venture capital funding in 2024 (CED)
  • Notable Investors: Microsoft, Google, Falfurrias Management Partners, RevTech Labs Foundation
  • Research Centers and Universities: University of North Carolina at Charlotte, Northeastern University, North Carolina Research Campus

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account