webAI Logo

webAI

Software Engineer MLX

Reposted 13 Days Ago
Remote
Hiring Remotely in USA
Mid level
Remote
Hiring Remotely in USA
Mid level
Optimize machine learning models for iOS deployment, focusing on performance using C++, Python, MLX, and Metal frameworks while collaborating with teams.
The summary above was generated by AI

About Us: 

webAI is pioneering the future of artificial intelligence by establishing the first distributed AI infrastructure dedicated to personalized AI. We recognize the evolving demands of a data-driven society for scalability and flexibility, and we firmly believe that the future of AI lies in distributed processing at the edge, bringing computation closer to the source of data generation. Our mission is to build a future where a company's valuable data and intellectual property remain entirely private, enabling the deployment of large-scale AI models directly on standard consumer hardware without compromising the information embedded within those models. We are developing an end-to-end platform that is secure, scalable, and fully under the control of our users, empowering enterprises with AI that understands their unique business. We are a team driven by truth, ownership, tenacity, and humility, and we seek individuals who resonate with these core values and are passionate about shaping the next generation of AI. 


About the Role: 

WebAI is hiring a Software Engineer MLX to optimize machine learning models for iOS deployment. You’ll use your expertise in C++, Python, hardware-aware programming, and Apple’s MLX and Metal frameworks to accelerate performance on mobile devices. If you’re passionate about building efficient, on-device AI solutions, we want to hear from you. 


Key Responsibilities: 

  • Optimize and deploy machine learning models on iOS devices using MLX and Metal frameworks. 
  • Develop high-performance, hardware-aware code in C++ and/or Python, focusing on vectorization, multi-threading, and system optimization. 
  • Build and optimize custom Metal kernels and computational graphs for mobile acceleration. 
  • Deploy and fine-tune PyTorch models for efficient, on-device inference. 
  • Apply quantization, tensor fusion, and batching strategies to maximize model performance and minimize size. 
  • Collaborate with AI researchers and mobile teams to deliver scalable, production-ready solutions.
  • Monitor and improve model performance across the iOS lifecycle to ensure efficiency and reliability. 

Required Skills & Qualifications: 

  • Bachelor’s, Master’s, or Ph.D. in Computer Science, Information Science, 
    • Electrical/Electronics/Telecommunications Engineering, Information Technology, or a related field — or equivalent work experience. 
  • Strong background in computer architecture and hardware-aware programming. 
  • Proficiency in C++ and/or Python, with a focus on optimizing computational workloads. 
  • Experience with SIMD vectorization, multi-threading, and low-level performance tuning.
  • Hands-on experience in Machine Learning, Deep Learning, and/or Data Science. 

Preferred Qualifications: 

  • Familiarity with the iOS application lifecycle and memory optimization. 
  • Experience deploying PyTorch models on mobile devices, specifically iOS. 
  • Proficiency with Apple’s MLX framework for machine learning acceleration. 
  • Deep knowledge of Metal and Metal Performance Shaders (MPS). 
  • Experience developing custom Metal kernels for specialized operations. 
  • Strong understanding of computational graph optimization, batching strategies, device training, and fine-tuning. 
  • Knowledge of quantization techniques, tensor operation fusion, and graph reconstruction to optimize model performance and size. 

We at webAI are committed to living out the core values we have put in place as the foundation on which we operate as a team. We seek individuals who exemplify the following: 

  • Truth - Emphasizing transparency and honesty in every interaction and decision. 
  • Ownership - Taking full responsibility for one’s actions and decisions, demonstrating commitment to the success of our clients. 
  • Tenacity - Persisting in the face of challenges and setbacks, continually striving for excellence and improvement. 
  • Humility - Maintaining a respectful and learning-oriented mindset, acknowledging the strengths and contributions of others.

Benefits: 

  • Competitive salary and performance-based incentives. 
  • Comprehensive health, dental, and vision benefits package. 
  • 401k Match 
  • $200/mos Health and Wellness Stipend 
  • $400/year Continuing Education Credit 
  • Free parking, for in-office employees 
  • Unlimited Approved PTO 
  • Parental, Bereavement Leave 
  • Supplemental Life Insurance 

webAI is an Equal Opportunity Employer and does not discriminate against any employee or applicant on the basis of age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable laws, regulations and ordinances. We adhere to these principles in all aspects of employment, including recruitment, hiring, training, compensation, promotion, benefits, social and recreational programs, and discipline. In addition, it is the policy of webAI to provide reasonable accommodation to qualified employees who have protected disabilities to the extent required by applicable laws, regulations and ordinances where a particular employee works.

Top Skills

C++
Metal
Mlx
Python
PyTorch

Similar Jobs

25 Days Ago
Remote
United States
180K-180K
Expert/Leader
180K-180K
Expert/Leader
Software
The Staff Software Engineer (ML) will develop ML infrastructure, implement models, optimize pipelines, and collaborate with teams on technical projects.
Top Skills: AirflowJaxKubeflowMlflowPythonPyTorchTensorFlow
3 Hours Ago
Remote or Hybrid
US
105K-148K Annually
Senior level
105K-148K Annually
Senior level
Artificial Intelligence • eCommerce • Information Technology • Internet of Things • Automation
The Sr Engineer-Mainframe DB2 is responsible for installing, maintaining, and supporting DB2 software, facilitating project planning, and providing technical assistance in disaster recovery and business continuity.
Top Skills: BmcComputer AssociatesCompuwareDb2Ibm MainframesQmfZ/Os
3 Hours Ago
Remote or Hybrid
WI, USA
174K-174K
Expert/Leader
174K-174K
Expert/Leader
Artificial Intelligence • eCommerce • Information Technology • Internet of Things • Automation
The Enterprise Architect collaborates with clients to develop infrastructure solutions, leveraging deep technical expertise to strengthen relationships and drive business outcomes.
Top Skills: AWSAzureCloudConverged InfrastructureDataDigitalManaged SolutionsNetworkingSecurityVmc On Aws

What you need to know about the Charlotte Tech Scene

Ranked among the hottest tech cities in 2024 by CompTIA, Charlotte is quickly cementing its place as a major U.S. tech hub. Home to more than 90,000 tech workers, the city’s ecosystem is primed for continued growth, fueled by billions in annual funding from heavyweights like Microsoft and RevTech Labs, which has created thousands of fintech jobs and made the city a go-to for tech pros looking for their next big opportunity.

Key Facts About Charlotte Tech

  • Number of Tech Workers: 90,859; 6.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Lowe’s, Bank of America, TIAA, Microsoft, Honeywell
  • Key Industries: Fintech, artificial intelligence, cybersecurity, cloud computing, e-commerce
  • Funding Landscape: $3.1 billion in venture capital funding in 2024 (CED)
  • Notable Investors: Microsoft, Google, Falfurrias Management Partners, RevTech Labs Foundation
  • Research Centers and Universities: University of North Carolina at Charlotte, Northeastern University, North Carolina Research Campus

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account