As an Applied AI Data Scientist, you'll develop and validate AI systems to improve clinical data for oncology research, collaborate with stakeholders, and enhance data quality for impactful insights.
Reimagine the infrastructure of cancer care within a community that values integrity, inspires growth, and is uniquely positioned to create a more modern, connected oncology ecosystem. We're looking for an Applied Data Scientist to help us accomplish our mission to improve and extend lives by learning from the experience of every person with cancer. Are you ready to be the next changemaker in cancer care?
What You'll Do
At Flatiron, we're advancing the use of machine learning and generative AI to extract clinically relevant information from unstructured medical notes to create de-identified oncology research datasets. The Discovery team is helping to build these next generation research data products, developing and applying ML & LLMs to capture a complete picture of the patient journey. The Discovery has team members spanning many different fields, from ML engineers and data scientists, to product management and oncologists.
As part of our team, you will apply existing internal and off-the-shelf external AI systems and validate AI generated data sets that are used by clinicians and researchers to evolve cancer research, generate clinical insights, and learn from the experience of millions of people living with cancer. Engaging with a cross-functional group of stakeholders both within Discovery and across the company, you will contribute to the build out of our data sets from scoping through to validation, productionization and delivery.
In addition, you'll also:
Who You Are
You're a product-focused data scientist, with creative analytical problem-solving skills ready to tackle the problems of measuring the performance of complex datasets & the systems that build them. You're excited to learn about oncology from our clinical stakeholders and work with them to apply AI to extract nuanced clinical concepts from the medical record and validate the fitness-for-use of that data for oncology research. You're a kind, passionate and collaborative problem-solver who seeks and gives candid feedback, and values the chance to make an important impact.
Extra Credit
Where You'll Work
In this hybrid role, you'll have a defined work location that includes work from home and 3 office days set by you and your team. For more information on our approach to hybrid work, please visit the how we work website.
Life at Flatiron
At Flatiron Health, we offer a full range of benefits to support you and your loved ones so you can focus your working hours on improving cancer care and accelerating cancer research, and your non-working hours on everything else life has to offer:
In addition to our robust benefit offerings, visit our Life at Flatiron page to learn how we support continuous learning and celebrate inclusion and belonging in the workplace.
Preferred Primary Location: NY office
The annual pay range reflected above for this position is based on the preferred primary location of the role which is listed in the job description. Salary ranges for other locations vary from the range reflected above. Base pay offered may vary depending on job-related knowledge, skills, and experience. An annual bonus and equity may be provided as part of the compensation package, in addition to a full range of medical, financial, and/or other benefits, dependent on the position offered.
What You'll Do
At Flatiron, we're advancing the use of machine learning and generative AI to extract clinically relevant information from unstructured medical notes to create de-identified oncology research datasets. The Discovery team is helping to build these next generation research data products, developing and applying ML & LLMs to capture a complete picture of the patient journey. The Discovery has team members spanning many different fields, from ML engineers and data scientists, to product management and oncologists.
As part of our team, you will apply existing internal and off-the-shelf external AI systems and validate AI generated data sets that are used by clinicians and researchers to evolve cancer research, generate clinical insights, and learn from the experience of millions of people living with cancer. Engaging with a cross-functional group of stakeholders both within Discovery and across the company, you will contribute to the build out of our data sets from scoping through to validation, productionization and delivery.
In addition, you'll also:
- Work with our clinical stakeholders to apply existing AI systems to turn raw clinical data into high quality research data
- Become a subject matter expert on our data and its capabilities and collaborate closely across the team to understand data needs and provide analytical support that enhances model development and deployment.
- Work with research scientists and oncologists to validate that our team's models can be used to generate sound scientific insights, including full dataset performance analyses
- Work closely with subject matter experts & ML researchers to define requirements for training and evaluation datasets, and maintain software pipelines for the generation of these sets.
- Provide analytic support and create custom data outputs for cross-functional teams such as our team of clinical experts.
- Interface with internal scientific & clinical stakeholders to understand what data they need to conduct high quality research.
- Work cross-functionally with software engineers to productionize, scale, and monitor our team's models.
Who You Are
You're a product-focused data scientist, with creative analytical problem-solving skills ready to tackle the problems of measuring the performance of complex datasets & the systems that build them. You're excited to learn about oncology from our clinical stakeholders and work with them to apply AI to extract nuanced clinical concepts from the medical record and validate the fitness-for-use of that data for oncology research. You're a kind, passionate and collaborative problem-solver who seeks and gives candid feedback, and values the chance to make an important impact.
- You have 3+ years of relevant working experience as an applied data scientist or similar technical data-oriented role, including relevant applied work in a graduate program. Some prior experience with ML or LLMs is preferred.
- You understand how machine learning and AI systems are measured and can analyze an existing system to understand the quality of its output, assess where improvements are needed, and communicate the impact of those improvements to stakeholders
- You have collaborated with other technical team members in a production development environment using formal version control, Python (including data manipulation in pandas, polars or a similar framework), and SQL.
- You're impact-oriented, and care deeply about creating real change for customers, users, and ultimately patients. You choose the right, rather than the flashiest, method available to reach your goals.
- You are a clear and confident communicator who can break down complex data analyses to tell a compelling story.
- You have led cross-functional initiatives and excel at influencing decision-making without authority.
Extra Credit
- You have experience working with data in a healthcare setting.
- You have experience with the risks of bias in machine learning, health equity research/analysis or have worked with underrepresented groups in a clinical research setting.
- You have experience working in dbt or other ETL frameworks
- You have experience with deep learning and traditional NLP methods.
Where You'll Work
In this hybrid role, you'll have a defined work location that includes work from home and 3 office days set by you and your team. For more information on our approach to hybrid work, please visit the how we work website.
Life at Flatiron
At Flatiron Health, we offer a full range of benefits to support you and your loved ones so you can focus your working hours on improving cancer care and accelerating cancer research, and your non-working hours on everything else life has to offer:
- Work/life autonomy via flexible work hours and flexible paid time off
- Comprehensive compensation package
- 401(k) contribution to help you reach your retirement planning goals
- Financial health resources including 1:1 financial advice
- Mental well-being tools and services
- Parental benefits and policies including family-building care and generous leave
- Path to parenthood programs supporting fertility, adoption and surrogacy
- Travel support for safe healthcare services
In addition to our robust benefit offerings, visit our Life at Flatiron page to learn how we support continuous learning and celebrate inclusion and belonging in the workplace.
Preferred Primary Location: NY office
The annual pay range reflected above for this position is based on the preferred primary location of the role which is listed in the job description. Salary ranges for other locations vary from the range reflected above. Base pay offered may vary depending on job-related knowledge, skills, and experience. An annual bonus and equity may be provided as part of the compensation package, in addition to a full range of medical, financial, and/or other benefits, dependent on the position offered.
Top Skills
ETL
Healthcare Data
Llms
Machine Learning
Python
SQL
Similar Jobs at Flatiron Health
Healthtech • Software • Biotech • Pharmaceutical
As a Senior Cloud Security Engineer, you'll define security policies, prevent breaches, ensure secure designs, and advocate for security awareness in cloud infrastructure.
Top Skills:
AnsibleAWSSpaceliftTerraform
What you need to know about the Charlotte Tech Scene
Ranked among the hottest tech cities in 2024 by CompTIA, Charlotte is quickly cementing its place as a major U.S. tech hub. Home to more than 90,000 tech workers, the city’s ecosystem is primed for continued growth, fueled by billions in annual funding from heavyweights like Microsoft and RevTech Labs, which has created thousands of fintech jobs and made the city a go-to for tech pros looking for their next big opportunity.
Key Facts About Charlotte Tech
- Number of Tech Workers: 90,859; 6.5% of overall workforce (2024 CompTIA survey)
- Major Tech Employers: Lowe’s, Bank of America, TIAA, Microsoft, Honeywell
- Key Industries: Fintech, artificial intelligence, cybersecurity, cloud computing, e-commerce
- Funding Landscape: $3.1 billion in venture capital funding in 2024 (CED)
- Notable Investors: Microsoft, Google, Falfurrias Management Partners, RevTech Labs Foundation
- Research Centers and Universities: University of North Carolina at Charlotte, Northeastern University, North Carolina Research Campus

