Vectara Logo

Vectara

Machine Learning (ML) Engineer

Reposted 12 Days Ago
Remote
Hiring Remotely in US
Senior level
Remote
Hiring Remotely in US
Senior level
Develop, evaluate, and deploy AI systems and machine learning models, focusing on NLP and multimodal models. Collaborate with teams and publish research.
The summary above was generated by AI

Vectara provides a scalable platform to deploy your Enterprise AI Agents and AI Assistants with Accuracy, Security, and Explainability like no other solution. Our enterprise RAG Platform offers unparalleled Accuracy, Security, and Explainability by leveraging the strongest models for retrieval, embedding, reranking, a optimized LLM trained for quality, and advanced Hallucination Mitigation. We are the developers of the Hughes Hallucination Evaluation Model and Correction model, core to ensuring accuracy, quality, and responsible AI that is production ready. These innovations have been cited in the New York Times, Visual Capitalist, and many other leading publications. This platform has allowed us to be very successful with over 100 Enterprise clients including the likes of large US military organizations, Financial services, Healthcare, and Manufacturing.


Our founding team includes industry veterans and experts in neural information retrieval and distributed systems from Google. Join us as we pursue our mission to help the world find meaning. People at Vectara are passionate about ensuring customers take advantage of breakthroughs in applied Artificial Intelligence (AI) to solve real-world technology and business problems today. Our team is a group of unquestionable all-stars in their respective fields of computer science and business from Google, Cloudera, Splunk, MongoDB, Elastic, and more.


Job responsibilities 

  • Design, prototype, research and build AI systems for Vectara.
  • Train, evaluate and deploy ML models in the domains of Natural Language Processing, Information Retrieval, AI Agents, Large Language Models (LLMs) and Multimodal Large Language Model (MMLLMs). 
  • Improve the quality of Vectara’s AI Agents and RAG-as-a-service platform, working on features like multilinguality, self-supervised learning, agentic behavior and hallucination reduction.
  • Publish technical blogs, papers, and patents. 

Requirements:

  • BS/MS in Computer Science, Statistics, Electrical/Computer Engineering, Mathematics, or a related field. 
  • 5+/4+ years of professional work experience after BS/MS applying machine learning to real-world problems, and crafting scalable and effective ML/AI solutions.
  • Strong domain knowledge in at least one of the following: RAG, LLM, information retrieval, Multimodal LLMs.
  • Excellent programming skills in Python. Proficiency in data/ML libraries such as pandas, transformers, and torch.
  • Familiarity with the technical details of deep learning concepts, such as Transformers, Retrieval-Augmented Generation (RAG), mixture of experts (MoE).
  • Hands-on experience in training ML systems end-to-end from data curation to evaluation and deployment.

Preferred requirements:

  • PhD in Computer Science/Engineering with 1+ years of industry experience. 
  • Publications in prestigious venues such as ACL, NAACL, EMNLP, NeurIPS, ICML, ICLR as a key author. 
  • Experience as an ML engineer in an early-stage, high growth environment. 
  • Expertise in the following areas:
    • Embedding models, rerankers
    • Multimodal retrieval, question answering, and reasoning
    • Vector databases, BM25
    • Planning and reasoning in LLMs
    • Multilinguality in LLMs
    • NLG Evaluation such as hallucination detection

Location requirements:  We support remote applicants from all over the US but candidates who can come to the office 2-3 days a week in our Palo Alto office are preferred. 


Equity and Salary Range: 

Salary is just one component of Vectara’s employee compensation. Our full-time employees are also equity owners in the company, which although not an immediate cash component, can have positive impacts on long-term total compensation for each participating employee. We would be remiss if we didn’t highlight and celebrate our focus on engaging many of our employees in being economic co-owners of the business.

Vectara welcomes all. We value the collective wisdom of people from different backgrounds, experiences, abilities and perspectives.  We never discriminate on the basis of race, religion, national origin, gender identity or expression, sexual orientation, age, or marital, veteran, or disability status. Vectara has a positive and supportive culture—we look for people who are inventive and work to be a little better every single day. We seek to be smart, humble, hardworking and, above all, curious. After all, we are on a mission to find meaning.  

Perks and Benefits:

100% paid Medical, Dental, Vision begins on your first day! Option of Health Savings Account (HSA) or Flexible Savings Account (FSA). Generous paid time off (PTO) plus paid sick time, holidays, and company rest days. Professional development and training opportunities. Company virtual happy hours and fun team building activities and more. 

Similar Jobs

36 Minutes Ago
Remote or Hybrid
172K-301K Annually
Senior level
172K-301K Annually
Senior level
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Lead architecture and roadmap for a multi-agent AI platform, design agent orchestration, context/memory management, and productionize LLM capabilities at enterprise scale. Provide technical leadership, establish test/evaluation frameworks for non-deterministic AI systems, mentor senior engineers, and align cross-functional strategy to deploy secure, highly available agent-driven features.
Top Skills: AutogenAWSAzureDistributed SystemsGCPGoHigh-Throughput Api DesignIamJavaLangchainLlmsMicroservicesModel Context Protocol (Mcp)Python
4 Hours Ago
Remote or Hybrid
296K-424K Annually
Expert/Leader
296K-424K Annually
Expert/Leader
Automotive • Big Data • Information Technology • Robotics • Software • Transportation • Manufacturing
Lead technical vision and architecture for ML-driven trajectory generation in autonomous vehicles. Build and deploy scalable training pipelines, integrate models into real-time safety-critical onboard systems, mentor senior engineers, drive cross-functional initiatives, and move solutions from research to production using simulation and large-scale datasets.
Top Skills: C++Distributed Ml PipelinesGenerative ModelsImitation LearningLarge-Scale Training InfrastructureMotion PlanningOnboard Real-Time SystemsPythonReinforcement LearningSimulation EnvironmentsTrajectory Planning
Yesterday
Remote or Hybrid
113K-193K Annually
Senior level
113K-193K Annually
Senior level
Artificial Intelligence • Big Data • Healthtech • Information Technology • Machine Learning • Software • Analytics
Design, build, and operate scalable data pipelines and AI-ready data products from large structured and unstructured sources (OCR/images/documents). Enable production Generative AI (RAG, semantic search), ensure data quality/observability, orchestrate CI/CD and infra-as-code, and mentor engineers while collaborating with product, analytics, and compliance teams.
Top Skills: AirflowAWSAzureChartjsDatabricksDatabricksDeequDelta LakeDockerEvent HubsGCPGithub ActionsGreat ExpectationsJavaKafkaKinesisKubernetesLlmOcrPlotlyPysparkPythonRagScalaSeabornSemantic SearchSnowflakeSparkSQLTerraform

What you need to know about the Charlotte Tech Scene

Ranked among the hottest tech cities in 2024 by CompTIA, Charlotte is quickly cementing its place as a major U.S. tech hub. Home to more than 90,000 tech workers, the city’s ecosystem is primed for continued growth, fueled by billions in annual funding from heavyweights like Microsoft and RevTech Labs, which has created thousands of fintech jobs and made the city a go-to for tech pros looking for their next big opportunity.

Key Facts About Charlotte Tech

  • Number of Tech Workers: 90,859; 6.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Lowe’s, Bank of America, TIAA, Microsoft, Honeywell
  • Key Industries: Fintech, artificial intelligence, cybersecurity, cloud computing, e-commerce
  • Funding Landscape: $3.1 billion in venture capital funding in 2024 (CED)
  • Notable Investors: Microsoft, Google, Falfurrias Management Partners, RevTech Labs Foundation
  • Research Centers and Universities: University of North Carolina at Charlotte, Northeastern University, North Carolina Research Campus

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account