Bank of America Logo

Bank of America

Software Engineer III -Gen AI Inferencing

Posted 4 Hours Ago
Be an Early Applicant
In-Office
4 Locations
Senior level
In-Office
4 Locations
Senior level
Develop and deliver AI capabilities, focusing on design, build, and operation of reusable toolkits for Gen AI. Collaborate with teams to achieve business goals and ensure compliance with requirements.
The summary above was generated by AI

Job Description:

At Bank of America, we are guided by a common purpose to help make financial lives better through the power of every connection. We do this by driving Responsible Growth and delivering for our clients, teammates, communities and shareholders every day.
Being a Great Place to Work is core to how we drive Responsible Growth. This includes our commitment to being an inclusive workplace, attracting and developing exceptional talent, supporting our teammates’ physical, emotional, and financial wellness, recognizing and rewarding performance, and how we make an impact in the communities we serve.
Bank of America is committed to an in-office culture with specific requirements for office-based attendance and which allows for an appropriate level of flexibility for our teammates and businesses based on role-specific considerations.
At Bank of America, you can build a successful career with opportunities to learn, grow, and make an impact. Join us!
 

Position Summary

Join a groundbreaking team at Bank of America, at the forefront of innovation in AI.  We are building the next generation of Gen AI platform, empowering new AI initiatives across Consumer, Small Business, Global Banking, and Wealth organizations. This is a unique opportunity to contribute to a critical platform that will enable secure, scalable, and high-performance AI capabilities across the organization. We value curiosity, collaboration, and a passion for pushing the boundaries of what’s possible with AI.

This position is focused on design, build, and operate of reusable toolkits for Gen AI RAG capabilities.

This job is responsible for developing and delivering complex requirements to accomplish business goals. Key responsibilities of the job include ensuring that software is developed to meet functional, non-functional and compliance requirements, and solutions are well designed with maintainability/ease of integration and testing built-in from the outset. Job expectations include a strong knowledge of development and testing practices common to the industry and design and architectural patterns.

Responsibilities:

  • Codes solutions and unit test to deliver a requirement/story per the defined acceptance criteria and compliance requirements
  • Designs, develops, and modifies architecture components, application interfaces, and solution enablers while ensuring principal architecture integrity is maintained
  • Mentors other software engineers and coach team on Continuous Integration and Continuous Development (CI-CD) practices and automating tool stack
  • Executes story refinement, definition of requirements, and estimating work necessary to realize a story through the delivery lifecycle
  • Performs spike/proof of concept as necessary to mitigate risk or implement new ideas
  • Automates manual release activities
  • Designs, develops, and maintains automated test suites (integration, regression, performance)
  • Utilizes multiple architectural components (across data, application, business) in design and development of client requirements
  • Manage multiple priorities, and simultaneously engage with multiple teams.
  • Participates in estimating work necessary to realize a story/requirement through the delivery lifecycle.
  • Be vocal and actively participate in all session with business stakeholders and agile teams.
  • Collaborate with product teams, data analysts and data scientists to design and build solutions.

Required qualifications:

  • 5+ years OOP in Python/Scala/Java programming experience with expert level development skills
  • Experience with AI/ML/GenAI Lifecycle Management and Development and its Ecosystem. Hands on experience building frameworks using MLOps, Fine – Tuning techniques, Inference Frameworks
  • Experience with deploying models using vLLM/Triton Inference Server in containers in production with automation. Performs Continuous Integration and Continuous Development (CI-CD) activities. Performance Tuning those models and deployment to provide higher throughput.
  • Track record of maintaining large scale Python/Unix based systems.
  • Hands on experience and knowledge generative AI RAG process for various use cases, including chunking, embedding, retrieval, reranking and summarization.
  • Hands-on experience in application development in one or more areas MongoDB, Redis, Angular/React Frameworks, Containerization, Building API based application leveraging FAST API services, JWT Integration, API Gateway
  • Develop efficient utilities, automation frameworks, data science platforms that can be utilized across multiple Data Science teams for AI/ML and GenAI work.
  • Working in large sized teams that collaboratively develop on a shared multi-repo codebase using IDEs (e.g. VS Code rather than Jupyter Notebooks), Continuous Integration (CI), Continuous Deployment (CD) and Continuous Testing
  • Strong automation, scripting, and Python development skills. Hands-on DevOps experience with one or more of the following enterprise development tools: Version Control (GIT/Bitbucket), Build Orchestration (Jenkins), Code Quality (SonarQube and pytest Unit Testing), Artifact Management (Artifactory) and Deployment (Ansible)

Desired Qualifications

  • Experience building & deploying Gen AI inferencing platform with open-source toolsets, building inferencing & servicing capabilities (AI Gateway, Policy store, Observability) for RAG/ MCP use cases etc.
  • Hands on experience on driving and maintaining a culture of quality, innovation, and experimentation.
  • Research on new tools and capabilities for better UI and UX for advanced analytics platform, quick prototype and demonstrate the features and capabilities, and participate on various user forums.

    Skills:

    • Application Development
    • Automation
    • Influence
    • Solution Design
    • Technical Strategy Development
    • Architecture
    • Business Acumen
    • DevOps Practices
    • Result Orientation
    • Solution Delivery Process
    • Analytical Thinking
    • Collaboration
    • Data Management
    • Risk Management
    • Test Engineering

    Shift:

    1st shift (United States of America)

    Hours Per Week: 

    40

    Top Skills

    Angular
    Ansible
    Artifactory
    Fast Api
    Git
    Java
    Jenkins
    Jwt
    Mlops
    MongoDB
    Pytest
    Python
    React
    Redis
    Scala
    Sonarqube
    Triton Inference Server
    Vllm
    HQ

    Bank of America Charlotte, North Carolina, USA Office

    100 North Tryon Street, Charlotte, NC, United States, 28202

    Similar Jobs

    4 Hours Ago
    In-Office
    4 Locations
    Senior level
    Senior level
    Big Data • Fintech • Mobile • Payments • Financial Services • Data Privacy
    The Software Engineer III will design and build reusable toolkits for Gen AI capabilities, ensuring compliance and maintainability while automating processes and mentoring others.
    Top Skills: AngularAnsibleApi GatewayArtifactoryBitbucketContainerizationFast ApiGitJavaJenkinsJwtMlopsMongoDBPytestPythonReactRedisScalaSonarqube
    57 Minutes Ago
    In-Office
    4 Locations
    75K-110K
    Mid level
    75K-110K
    Mid level
    Cloud • Information Technology • Machine Learning
    The FP&A Analyst will partner with sales to analyze financial deals, evaluate pricing strategies, and guide contract negotiations.
    Top Skills: Accounting SystemsFinancial ModelingFinancial SystemsP&L Analysis
    2 Hours Ago
    In-Office
    3 Locations
    143K-210K
    Senior level
    143K-210K
    Senior level
    Cloud • Information Technology • Machine Learning
    The Business Systems Engineer will enhance data center operations by managing asset lifecycle, automating infrastructure systems, and integrating workflows for efficiency.
    Top Skills: Dcim SystemsJavaScriptNode.jsProcorePythonReactTypescript

    What you need to know about the Charlotte Tech Scene

    Ranked among the hottest tech cities in 2024 by CompTIA, Charlotte is quickly cementing its place as a major U.S. tech hub. Home to more than 90,000 tech workers, the city’s ecosystem is primed for continued growth, fueled by billions in annual funding from heavyweights like Microsoft and RevTech Labs, which has created thousands of fintech jobs and made the city a go-to for tech pros looking for their next big opportunity.

    Key Facts About Charlotte Tech

    • Number of Tech Workers: 90,859; 6.5% of overall workforce (2024 CompTIA survey)
    • Major Tech Employers: Lowe’s, Bank of America, TIAA, Microsoft, Honeywell
    • Key Industries: Fintech, artificial intelligence, cybersecurity, cloud computing, e-commerce
    • Funding Landscape: $3.1 billion in venture capital funding in 2024 (CED)
    • Notable Investors: Microsoft, Google, Falfurrias Management Partners, RevTech Labs Foundation
    • Research Centers and Universities: University of North Carolina at Charlotte, Northeastern University, North Carolina Research Campus

    Sign up now Access later

    Create Free Account

    Please log in or sign up to report this job.

    Create Free Account