BentoML Logo

BentoML

Forward Deployed Engineer

Reposted 25 Days Ago
Remote
3 Locations
Mid level
Remote
3 Locations
Mid level
As a Forward Deployed Engineer at BentoML, you'll design and implement full-stack AI solutions, manage customer relationships, and influence product direction while working with cutting-edge AI technologies.
The summary above was generated by AI
About BentoML

BentoML is a leading inference platform provider that helps AI teams run large language models and other generative AI workloads at scale. With support from investors such as DCM, enterprises around the world rely on us for consistent scalability and performance in production. Our portfolio includes both open source and commercial products, and our goal is to help each team build its own competitive advantage through AI.

Role

Forward Deployed Engineers sit at the intersection of core engineering, product strategy, and customer success. You’ll partner directly with customers to understand real-world problems, then design, build, and launch production-ready AI solutions on the Bento platform. You’ll own engagements from first conversation to production rollout, while feeding insights back into our core product and engineering.

Example projects:

  • https://bentoml.com/blog/comfy-pack-serving-comfyui-workflows-as-apis

  • https://bentoml.com/blog/neurolabs-faster-time-to-market-and-save-cost-with-bentoml

  • https://bentoml.com/blog/accelerating-ai-innovation-at-yext-with-bentoml

Responsibilities
  • Own projects end-to-end. Scope, architect, and ship full-stack AI/ML systems using Python (and any language that gets the job done). You’ll collaborate closely with engineering teams, balancing speed with reliability and security.

  • Build innovative solutions. Optimize and deploy state-of-the-art models, tune inference pipelines, and push the boundaries of performance, cost, and scale.

  • Own customer relationships. Serve as the primary technical point-of-contact, guiding customers from prototype to production and ensuring long-term success.

  • Act as a product manager in the field. Capture qualitative and quantitative feedback, synthesize themes, and translate them into clear product requirements that influence our roadmap.

  • Publish and share your work. Write blog posts, open-source examples, and conference talks that expand what’s possible with AI inference.

Qualifications
  • Demonstrated skill building ML pipelines, model inference, and AI agents.

  • Ability to navigate ambiguity, make pragmatic technical trade-offs, and drive projects to completion with minimal oversight.

  • Excellent written and verbal communication skills; you comfortably translate deep technical topics for diverse audiences.

  • Entrepreneurial mindset; you thrive in fast-moving environments and enjoy wearing multiple hats.

Why join us
  • Global impact. Your projects power customer products used by millions worldwide.

  • Cutting-edge AI. Work daily with the latest open-source and proprietary models, shaping how they’re applied in production.

  • Influence product direction. Your field insights directly steer both our open-source libraries and commercial platform.

  • Remote-first culture. Work where you’re happiest, supported by asynchronous processes and quarterly in-person meet-ups.

  • Showcase your expertise. We celebrate and amplify your blog posts, sample repos, and conference talks to the broader community.

Top Skills

Python

Similar Jobs

Yesterday
In-Office or Remote
3 Locations
100K-150K Annually
Junior
100K-150K Annually
Junior
Artificial Intelligence • Digital Media
The Forward Deployed Engineer will develop and deploy generative image technology for enterprise customers, focusing on prototyping and customer engagement.
Top Skills: APIsOpen Source ModelsPython
3 Days Ago
In-Office or Remote
Montréal, QC, CAN
6-10 Annually
Senior level
6-10 Annually
Senior level
Information Technology • Security • Big Data Analytics
Collaborate with clients to deploy data solutions, enhance data infrastructure, ensure compliance, and solve technical challenges while supporting data strategy integration.
Top Skills: PythonSQLTypescript
9 Days Ago
Remote
Canada
Senior level
Senior level
Software
As a Forward Deployed Engineer at Vanta, collaborate with customers to design and implement integrations, troubleshoot issues, and enhance product adoption, leveraging strong software engineering skills over 5-10+ years.
Top Skills: APIsGoPythonSdksTypescript

What you need to know about the Charlotte Tech Scene

Ranked among the hottest tech cities in 2024 by CompTIA, Charlotte is quickly cementing its place as a major U.S. tech hub. Home to more than 90,000 tech workers, the city’s ecosystem is primed for continued growth, fueled by billions in annual funding from heavyweights like Microsoft and RevTech Labs, which has created thousands of fintech jobs and made the city a go-to for tech pros looking for their next big opportunity.

Key Facts About Charlotte Tech

  • Number of Tech Workers: 90,859; 6.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Lowe’s, Bank of America, TIAA, Microsoft, Honeywell
  • Key Industries: Fintech, artificial intelligence, cybersecurity, cloud computing, e-commerce
  • Funding Landscape: $3.1 billion in venture capital funding in 2024 (CED)
  • Notable Investors: Microsoft, Google, Falfurrias Management Partners, RevTech Labs Foundation
  • Research Centers and Universities: University of North Carolina at Charlotte, Northeastern University, North Carolina Research Campus

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account