Bank of America Logo

Bank of America

Senior Engineer-AI Inference

Reposted 7 Days Ago
Be an Early Applicant
In-Office
4 Locations
Senior level
In-Office
4 Locations
Senior level
This role involves designing and building AI inferencing capabilities, leading engineering approaches, and collaborating with teams on complex AI solutions.
The summary above was generated by AI

Job Description:

At Bank of America, we are guided by a common purpose to help make financial lives better through the power of every connection. We do this by driving Responsible Growth and delivering for our clients, teammates, communities and shareholders every day.
Being a Great Place to Work is core to how we drive Responsible Growth. This includes our commitment to being an inclusive workplace, attracting and developing exceptional talent, supporting our teammates’ physical, emotional, and financial wellness, recognizing and rewarding performance, and how we make an impact in the communities we serve.
Bank of America is committed to an in-office culture with specific requirements for office-based attendance and which allows for an appropriate level of flexibility for our teammates and businesses based on role-specific considerations.
At Bank of America, you can build a successful career with opportunities to learn, grow, and make an impact. Join us!
 

Position Summary:

Join a groundbreaking team at Bank of America, at the forefront of innovation in AI.  We are building the next generation of Gen AI platform, empowering new AI initiatives across Consumer, Small Business, Global Banking, and Wealth organizations. This is a unique opportunity to contribute to a critical platform that will enable secure, scalable, and high-performance AI capabilities across the organization. We value curiosity, collaboration, and a passion for pushing the boundaries of what’s possible with AI.

This position is focused on design, build, and serve the Gen AI inferencing capabilities.

This job is responsible for defining and leading the engineering approach for complex features to deliver significant business outcomes. Key responsibilities of the job include delivering complex features and technology, enabling development efficiencies, providing technical thought leadership based on conducting multiple software implementations, and applying both depth and breadth in a number of technical competencies. Additionally, this job is accountable for end-to-end solution design and delivery.

Responsibilities:

  • Ensures that the design and engineering approach for complex features are consistent with the larger portfolio solution
  • Define the technology tool stack for the solution and evaluate and adapt new testing tool/framework/practices for team(s)
  • Enables team(s)/applications with Continuous Integration/Continuous Development (CI/CD) capabilities and engages with other technical stakeholders pertaining to efficient functioning of CI-CD pipeline
  • Guides and influences team(s) on design and best practices for high code performance –e.g. pairing, code reviews
  • Provides end-to-end delivery of complex features, including automation, for either a single team or multiple teams, at the program level
  • Conducts research, design prototyping and other exploration activities such as evaluating new toolsets and components for release management, CI/CD, and features
  • Works with stakeholders to establish high-level solution needs and with architects for technical requirements
  • Collaborate with product teams, data analysts and data scientists to design and build solutions.
  • Design and execute the implementation plans to both move forward strategically, while at the same time ensuring the current technology stack is supporting current needs.
  • Manage multiple priorities, and simultaneously engage with multiple teams worldwide.
  • Be vocal and actively participate in all session with business stakeholders and agile teams.
  • Manage next generation of architectural decision for advanced analytics platform, create strategy, roadmaps, present to tech and non-tech leaders.
  • Coach and mentor team members.

Required qualifications:

  • Minimum 8 years of relevant experience required.
  • Experience in Model Ops and design, software development with proven effectiveness in delivering technology in fast-paced, demanding, industry driven environment for AI/ML, and advanced analytics.
  • Hands on experience in both Python development on Linux. Strong understanding of modern open-source data science platform architecture for storage & compute separation, interactive development workbenches, containers, and toolsets such as Jupyter, VSCode etc.
  • Experience of data sources and Vector Store platforms such as Redis, Solar, Postgres DB, FAISS, Teradata, Oracle, SQL Server, Hadoop etc.
  • Experienced in using design patterns and following best software engineering practices.
  • An understanding of fundamental algorithms and ability to optimize existing code.
  • Proficient written and verbal communication skills to support and shape the platform and clearly articulate technical designs and concepts; and to communicate effectively with all levels within the organization.
  • Experience with deploying models using vLLM/Triton Inference Server
  • Performance Tuning those models and deployment to provide higher throughput.
  • Experience with various inference metrics, and related monitoring and observability.
  • Experience with serving multiple tenants/clients with model endpoints with secure boundaries.
  • Experience with Atheization & Authorization, Policy as Code, Systems Integration, and Model Routing
  • Model Evaluation frameworks to evaluate different models and their tradeoffs between efficiency and metrics.
  • Experience building RAG for various knowledge bases, and document types.
  • Model Monitoring – Ability to collect metrics to measure things like Model Drift, KPIs.
  • Self-starter with the ability to challenge conventions, excellent communication skills.
  • Strong analytical skills which enable ability to problem solve, apply reason, take initiative, use judgment, and perform concurrent tasks.
  • Follows Test Driven Development practices including continual integration and clean code principles.

Desired Qualifications:

  • Experience developing Gen AI training and Inferencing platform with open-source model, Gen AI Model servicing capabilities, designing RAG frameworks, MCP modules for enterprise data systems.

Skills:

  • Automation
  • Influence
  • Result Orientation
  • Stakeholder Management
  • Technical Strategy Development
  • Application Development
  • Architecture
  • Business Acumen
  • Risk Management
  • Solution Design
  • Agile Practices
  • Analytical Thinking
  • Collaboration
  • Data Management
  • Solution Delivery Process

Shift:

1st shift (United States of America)

Hours Per Week: 

40

Top Skills

Faiss
Hadoop
Jupyter
Linux
Oracle
Postgres Db
Python
Redis
Solar
SQL Server
Teradata
Triton Inference Server
Vllm
Vscode
HQ

Bank of America Charlotte, North Carolina, USA Office

100 North Tryon Street, Charlotte, NC, United States, 28202

Similar Jobs

6 Hours Ago
Remote or Hybrid
Trenton, NJ, USA
Expert/Leader
Expert/Leader
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
The role involves supporting sales as a technical consultant, conducting workshops, providing product demos, and collaborating with teams to enhance product offerings.
Top Skills: Ai-Powered ToolsCloud Software SolutionsServicenow
6 Hours Ago
In-Office
Jersey City, NJ, USA
29-29 Hourly
Junior
29-29 Hourly
Junior
Consumer Web • eCommerce • Machine Learning • Professional Services • Software • Sports • Analytics
The Sealing Lead will guide the Sealing Team, manage day-to-day operations, set goals, and motivate team members while ensuring quality sealing of collectibles.
Top Skills: Google Sheets
8 Hours Ago
Hybrid
Bridgewater, NJ, USA
180K-215K Annually
Expert/Leader
180K-215K Annually
Expert/Leader
Fintech • Information Technology • Insurance • Financial Services • Big Data Analytics
Lead the Data and Analytics teams for Sales, Broker, Finance, and Small Business Solutions at MetLife, overseeing data infrastructure, governance, engineering, and analytics to drive business outcomes.
Top Skills: HadoopMs AzurePower BISalesforceSQLTableau

What you need to know about the Charlotte Tech Scene

Ranked among the hottest tech cities in 2024 by CompTIA, Charlotte is quickly cementing its place as a major U.S. tech hub. Home to more than 90,000 tech workers, the city’s ecosystem is primed for continued growth, fueled by billions in annual funding from heavyweights like Microsoft and RevTech Labs, which has created thousands of fintech jobs and made the city a go-to for tech pros looking for their next big opportunity.

Key Facts About Charlotte Tech

  • Number of Tech Workers: 90,859; 6.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Lowe’s, Bank of America, TIAA, Microsoft, Honeywell
  • Key Industries: Fintech, artificial intelligence, cybersecurity, cloud computing, e-commerce
  • Funding Landscape: $3.1 billion in venture capital funding in 2024 (CED)
  • Notable Investors: Microsoft, Google, Falfurrias Management Partners, RevTech Labs Foundation
  • Research Centers and Universities: University of North Carolina at Charlotte, Northeastern University, North Carolina Research Campus

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account