Robots & Pencils Logo

Robots & Pencils

AI Engineer

Posted Yesterday
Be an Early Applicant
Easy Apply
In-Office or Remote
Hiring Remotely in Calgary, AB
Senior level
Easy Apply
In-Office or Remote
Hiring Remotely in Calgary, AB
Senior level
Develop and optimize LLM applications, monitor and enhance production AI systems, and ensure reliability and scalability while collaborating across teams.
The summary above was generated by AI

At Robots & Pencils, we build meaningful, scalable digital products by blending strategy, design, and engineering. We are seeking a Level 4 AI Engineer to build production LLM applications for an enterprise client as part of a long-term, delivery-focused engagement.

You will own the AI stack end-to-end, including RAG pipelines, prompt engineering, and evaluation frameworks. This is a hands-on role: you will write production code, tune prompts, build evaluation and observability systems, and iterate based on real user feedback.

There is a working proof of concept in place. Your responsibility is to make it production-ready and extend it with intelligent, reliable features that operate at enterprise scale.


What You’ll Do

AI & LLM Application Delivery

· Build, optimize, and evolve RAG pipelines, including retrieval strategies, chunking, and re-ranking

· Develop prompts and guardrails for domain-specific LLM applications

· Implement hallucination detection, mitigation, and fact-checking mechanisms

· Build embeddings-based search and recommendation features

· Validate AI features with real users and iterate based on qualitative and quantitative feedback

Evaluation, Monitoring & Reliability

· Set up and maintain LLM evaluation frameworks to measure quality, relevance, and reliability

· Implement observability and monitoring for production AI systems

· Monitor live AI systems and resolve quality, accuracy, and performance issues

· Continuously improve AI outputs based on evaluation data and user behavior

Platform & System Integration

· Work closely with product and engineering teams to integrate AI into user-facing features

· Build and maintain backend services in Python

· Integrate with vector databases to support retrieval and semantic search workflows

· Ensure AI solutions meet enterprise requirements for security, scalability, and maintainability

Delivery & Collaboration

· Collaborate with cross-functional partners across product, engineering, and design

· Operate effectively in environments with evolving requirements and ambiguity

· Communicate clearly with technical and non-technical stakeholders

· Take ownership of delivery outcomes from experimentation through production


Required Skills & Experience

· 8+ years of professional software engineering experience, with 4+ years focused on applied AI/ML or data-driven systems in production environments

· 3+ years building and operating production AI systems

· Strong hands-on experience with LLM applications, including RAG, prompt engineering, and evaluation

· Experience implementing hallucination detection and mitigation techniques

· Proficiency in Python

· Experience working with vector databases (Weaviate, Pinecone, or similar)

· Experience with LLM evaluation frameworks (Langfuse, Weights & Biases, or custom solutions)

· Production experience using Claude and/or GPT APIs

· Strong understanding of embeddings and semantic search

· Comfortable working with ambiguity and iterating on unclear problems

· Bachelor's degree in computer science, Engineering, Data Science, or a related technical field, or equivalent practical experience

· Advanced degree (Master’s or PhD) in a relevant field


Nice to Have

· Experience with Azure AI services, including Azure OpenAI and Cognitive Services

· Experience with document processing (PDF extraction, OCR)

· Exposure to audio or speech processing (e.g., Whisper or similar tools)

· Experience building enterprise B2B software

· Experience with ML classification and model training


Tech Stack

· LLMs: Claude (Anthropic), Azure OpenAI

· Vector Database: Weaviate

· Backend: Python

· Infrastructure: Azure

· Evaluation & Observability: Langfuse or similar


How You Work

· You are hands-on and delivery-focused, writing code and owning outcomes

· You balance speed with quality in production environments

· You communicate clearly and collaborate effectively across disciplines

· You take ownership of ambiguous problems and drive them to resolution

· You prioritize reliability, maintainability, and real-world impact


Why Robots & Pencils

· Real production impact not a POC that sits on a shelf

· Exposure to the full AI lifecycle: RAG, LLM applications, evaluation, classification, and monitoring

· End-to-end ownership of the AI stack and technical decision-making

· A small, senior team with direct access to enterprise clients


Top Skills

Azure Openai
Claude
Langfuse
Llms
Python
Weaviate

Similar Jobs

4 Days Ago
Remote
2 Locations
192K-287K Annually
Senior level
192K-287K Annually
Senior level
Artificial Intelligence • Productivity • Software • Automation
Zapier seeks an Applied AI Engineer to develop AI-driven automation tools. Responsibilities include working with large language models and improving AI performance, scalability, and reliability.
Top Skills: Attention MechanismsCloud InfrastructureLarge Language ModelsPythonRetrieval-Augmented Generation SystemsSemantic SearchTransformer NetworksTypescriptVector Databases
4 Hours Ago
Remote
3 Locations
Mid level
Mid level
Information Technology • Software • Analytics
The AI Engineer designs, develops, and deploys AI-driven applications for non-data scientists, enhancing decision-making and model accessibility for organizations.
Top Skills: AWSAzureDockerFastapiGCPKubernetesLlm EvaluationPrompt OptimizationPydanticPythonSqlalchemy
5 Days Ago
Easy Apply
Remote
CAN
Easy Apply
120K-140K Annually
Mid level
120K-140K Annually
Mid level
Artificial Intelligence • Robotics • Consulting
The Intermediate AI Developer will create AI agents for automating administrative tasks in workforce management, improve workflows, and mentor junior developers.
Top Skills: AzureMinioNext.JsOllamaPostgresPythonReactRedisWeaviate

What you need to know about the Charlotte Tech Scene

Ranked among the hottest tech cities in 2024 by CompTIA, Charlotte is quickly cementing its place as a major U.S. tech hub. Home to more than 90,000 tech workers, the city’s ecosystem is primed for continued growth, fueled by billions in annual funding from heavyweights like Microsoft and RevTech Labs, which has created thousands of fintech jobs and made the city a go-to for tech pros looking for their next big opportunity.

Key Facts About Charlotte Tech

  • Number of Tech Workers: 90,859; 6.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Lowe’s, Bank of America, TIAA, Microsoft, Honeywell
  • Key Industries: Fintech, artificial intelligence, cybersecurity, cloud computing, e-commerce
  • Funding Landscape: $3.1 billion in venture capital funding in 2024 (CED)
  • Notable Investors: Microsoft, Google, Falfurrias Management Partners, RevTech Labs Foundation
  • Research Centers and Universities: University of North Carolina at Charlotte, Northeastern University, North Carolina Research Campus

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account