iManage Logo

iManage

NLP Data Engineering Intern

Posted An Hour Ago
Be an Early Applicant
Hybrid
Chicago, IL
Internship
Hybrid
Chicago, IL
Internship
As an NLP Data Engineering Intern, you will transform text data into insights, design pipelines, and collaborate with teams to support AI applications.
The summary above was generated by AI

What is iManage U?
iManage U provides students the chance to experience a dynamic, rapid growth technology company firsthand. iManage will provide a structured program which delivers project-based activities, improved knowledge of business fundamentals, tackling complex problem solving, collaboration, team building, and some fun experiences along the way!  This year, our paid internship program will kick-off on Monday, June 8th and will run through Thursday, August 13th.  
This internship will be based out of our downtown Chicago office, with activities requiring in-person presence.
Goals of the Program:

  • iM Making An Impact: Leave your mark on your team by owning and completing assigned projects
  • iM A Mentee: Learn from teammates across departments & gain perspectives from a diversity of people
  • iM A Connector: Meet & connect with as many interns and iManage employees as possible
  • iM Inspired: Learn from our leadership team and ask questions during our lunch and learns
  • iM Social: Enjoy intern events, and everything iManage has to offer this summer

Being an NLP Data Engineering intern at iManage means…
You are excited about transforming unstructured text into meaningful insights that power AI and machine learning solutions. You thrive at the intersection of data engineering and natural language processing and are eager to contribute to the pipelines and datasets that fuel generative AI applications, agentic systems, and other NLP-driven capabilities across iManage.
As an NLP Data Engineering Intern on the AI and knowledge engineering team, you will get hands-on experience designing, building, and optimizing text data pipelines that power AI/ML and Generative AI solutions for our customers. You’ll collaborate with knowledge engineering, applied AI, and product teams to help prepare, enrich, and integrate document data. Your contributions will be essential to enabling intelligent, AI-powered features across the iManage platform.
iM Responsible For…

  • Performing exploratory analyses on large text corpora and developing preprocessing pipelines for training and evaluation data
  • Supporting the design of automated workflows for text normalization, deduplication, language identification, PII redaction, and metadata enrichment
  • Assisting with building automated data validation processes to ensure accuracy and consistency of NLP datasets
  • Contributing to dataset curation, prompt dataset preparation, labeling coordination, and text quality validation to support model fine-tuning, semantic search, and Gen AI evaluations
  • Partnering with the Applied AI team to understand data requirements and help build data interfaces for machine learning systems
  • Learning and applying data lineage best practices and data privacy, security, and governance principles
  • Maintaining highest quality standards through processes that identify and correct mistakes and inconsistencies
iM Qualified Because I have…
  • Current enrollment in a Master’s, or PhD program in Computer Science, Data Engineering, Data Science, Applied Mathematics, Computational Linguistics, or a related quantitative field
  • Proficiency in Python and experience using it to extract, structure, classify, and analyze text data
  • Foundational understanding of NLP concepts such as tokenization, embeddings, and semantic search
  • Familiarity with standard NLP libraries such as SpaCy, HuggingFace Datasets, or NLTK
  • Solid knowledge of data structures, algorithms, and statistics
  • Proficiency with Git and collaborative development workflows
  • A passion to learn and improve, and an eagerness to share knowledge with colleagues
  • Problem-solving, creativity, curiosity, and a collaborative mindset
Bonus points if you have..
  • Exposure to Microsoft Azure services such as Fabric, ADLS, AI Foundry, or Azure ML
  • Experience with data pipeline orchestration or workflow automation tools like Databricks
  • Familiarity with knowledge graphs or semantic data modeling

Don't meet every qualification listed above? Studies show that women and people of color are less likely to apply to jobs unless they meet all qualifications. At iManage, we are committed to building a diverse and inclusive environment, and encourage everyone to show up as their full authentic selves. We welcome those that come with a growth mindset and a hunger for learning; so, if you are excited about this role but your past experience doesn't align perfectly with every qualification we encourage you to apply anyways!  
About iManage
iManage is dedicated to Making Knowledge WorkTM.  Over one million professionals across 65+ countries rely on our intelligent, cloud-enabled, secure knowledge work platform to uncover and activate the knowledge that exists inside their business content and communications.   
We are continuously innovating to solve the most complex professional challenges and enable better business outcomes; Our work is not always easy but it is ambitious and rewarding.  
So we’re looking for people who love a challenge. People who are happiest when they’re solving problems and collaborating with the industry’s best and brightest. That’s the iManage way. It’s how we do things that might appear impossible. How we develop our employees’ strengths and unlock their potential. How we find meaning in everything we do.  
Whoever you are, whatever you do, however you work. Make it mean something at iManage.

Learn more at: www.imanage.com     

Please see our privacy statement for more information on how we handle your personal data: https://imanage.com/privacy-policy/     

 #LI-DNI

Top Skills

Databricks
Huggingface Datasets
Azure
Nltk
Python
Spacy

Similar Jobs at iManage

Yesterday
Hybrid
103K-135K Annually
Mid level
103K-135K Annually
Mid level
Artificial Intelligence • Cloud • Information Technology • Legal Tech • Productivity • Software
As an Endpoint Administrator, you will design, implement, and manage Intune solutions for endpoint management, ensuring device security and compliance while supporting users and deploying applications.
Top Skills: Azure AdConfiguration ManagerMicrosoft Defender For EndpointMicrosoft IntunePowershell
3 Days Ago
Hybrid
87K-130K Annually
Junior
87K-130K Annually
Junior
Artificial Intelligence • Cloud • Information Technology • Legal Tech • Productivity • Software
The Contracts Specialist will manage commercial contracts, optimize contract management platforms, and collaborate with other departments to streamline processes.
Top Skills: AsanaIronclad
3 Days Ago
Hybrid
103K-159K Annually
Mid level
103K-159K Annually
Mid level
Artificial Intelligence • Cloud • Information Technology • Legal Tech • Productivity • Software
As a Site Reliability Engineer, you'll develop resilient cloud platforms, automate processes, engage in cross-team collaboration, and oversee incident management, focusing on scaling and security.
Top Skills: AksAzureBashChefCi/CdDockerElkGoGrafanaJavaKubernetesPowershellPrometheusPythonRubyTerraform

What you need to know about the Charlotte Tech Scene

Ranked among the hottest tech cities in 2024 by CompTIA, Charlotte is quickly cementing its place as a major U.S. tech hub. Home to more than 90,000 tech workers, the city’s ecosystem is primed for continued growth, fueled by billions in annual funding from heavyweights like Microsoft and RevTech Labs, which has created thousands of fintech jobs and made the city a go-to for tech pros looking for their next big opportunity.

Key Facts About Charlotte Tech

  • Number of Tech Workers: 90,859; 6.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Lowe’s, Bank of America, TIAA, Microsoft, Honeywell
  • Key Industries: Fintech, artificial intelligence, cybersecurity, cloud computing, e-commerce
  • Funding Landscape: $3.1 billion in venture capital funding in 2024 (CED)
  • Notable Investors: Microsoft, Google, Falfurrias Management Partners, RevTech Labs Foundation
  • Research Centers and Universities: University of North Carolina at Charlotte, Northeastern University, North Carolina Research Campus

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account