Apryse Logo

Apryse

Full Stack Data Discovery Engineer

Posted 18 Hours Ago
In-Office
3 Locations
90K-138K Annually
Mid level
In-Office
3 Locations
90K-138K Annually
Mid level
The Full Stack Data Discovery Engineer will design scalable data pipelines, develop APIs and crawlers, and analyze external data. Responsibilities include conducting searches, data governance, and collaborating with stakeholders to integrate new data sources.
The summary above was generated by AI

The Role:

We are hiring a Full-Stack Data Discovery Engineer to design and ship end to end systems that uncover technology usage across public and private ecosystems.  You will build innovative backend pipelines (APIs, crawlers, document finger printers, package-registry miners, etc.) and frontend dashboards or analysis that transform raw signals into actionable insights.  You will combine engineering skill with investigative creativity to discover patterns across ecosystems and turn them into actionable intelligence.

 

Responsibilities:

  • Own the full stack: Design, build and optimize scalable data pipelines to discover OSINT and software usage across a wide public ecosystem.
  • Pipeline development: Develop APIs, microservices, crawlers, document fingerprinting to gather data securely and efficiently. Implement backoff/caching, data normalization, and persist to SQL/NoSQL indexes
  • Data Discovery: Conduct systematic searches across the web, public databases, developer ecosystems and other platforms to identify potential external data repositories relevant to organizational objectives.
  • Metadata and Attribution Analysis: Programmatically uncover and analyze metadata associated with identified data sources to understand data structure, content, quality, and potential use cases.
  • Signals & scoring: develop heuristics/ML‑lite ranking to identify relevant artifacts , deduplicate, and assign confidence scores.
  • Data Governance: Ensure data quality, security, compliance and governance.
  • Productize discovery: build internal tools that let non‑engineers run searches, review candidates, and export leads—fast and safely
  • Documentation and Reporting: Document data structures, origins (data lineage), and quality issues. Create clear, concise reports and presentations to communicate findings and recommendations to technical and non-technical stakeholders.
  • Collaboration: Work closely with data stewards, data architects, and internal business units to define data requirements and facilitate the integration of new data sources.
  • Innovation and Scale: Continuously explore new data sources, improve attribution logic and propose ML-based enhancements to finding and classifying data.

 

Requirements:

  • Education: Bachelor's degree in Computer Science, Engineering, Library Science, Information Systems, Data Management, or a related field (Master's degree preferred).
  • Experience: 1-5 years of proven experience as a full-stack developer and data engineer. Creating the initial inception and idea of the project.
  • Technical Skills:
    • Back-end: Python, SQL, Java and Node.js
    • Front-end: Modern JS/TS + React, component libraries, auth patterns, state mgmt.
    • Data & search: schema design, dedup/near‑dup logic, Elasticsearch/OpenSearch; building usable search/triage UIs.
    • Acquisition: Scrapy/Playwright/Puppeteer; API design with rate‑limit/backoff; ethical crawling.
    • Experience with cloud-native architecture and containerization. Familiarity with metadata standards (e.g., Dublin Core, XML) and data management tools.

Assets:

  • Knowledge of data visualization tools (e.g. Power BI, Tableau) to present findings.
  • Experience building internal platforms/tools used by end users or GTM teams.

Soft Skills:

  • Exceptional attention to detail and strong analytical thinking skills.
  • Excellent written and verbal communication skills, with the ability to translate technical findings into business insights.
  • Strong problem-solving aptitude and the ability to work independently and collaboratively in a fast-paced environment.

 

Benefits:

  • Competitive salary commensurate with experience & qualifications.
  • A comprehensive extended benefits package including health, dental and vision for you and your family.
  • A great team environment and resources, supporting you to do the best work of your life and providing unlimited career growth potential.
  • Annual recurring WFH allowance for you to purchase items you need for your home office.
  • On going support for learning development so you can master your craft.
  • Work with the hardware you're most comfortable with (Windows or Mac).
  • Diverse and inclusive workplace where we all learn from each other.
  • Excellent work-life balance with a flexible remote work environment.

 

 

Benefits:

  • Competitive salary commensurate with experience and qualifications. 
  • A comprehensive extended benefits package including health, dental and vision for you and your family, with company paid offerings.
  • 401K savings program with company match.
  • Generous paid time off (PTO) is offered to support the ability to rest and recharge.
  • A great team environment and resources, supporting you to do the best work of your life and providing unlimited career growth potential.
  • Highly autonomous and entrepreneurial environment.
  • Annual recurring WFH allowance for you to purchase items you need for your home office.
  • Ongoing support for learning development so you can master your craft.
  • Work with the hardware you're most comfortable with (Windows or Mac).
  • Diverse and inclusive workplace where we all learn from each other.


Company Description

As the industry-leading provider of document software development (SDK) technology powering everything from traditional desktop software to innovative web and mobile applications, at Apryse we are committed to delivering cutting-edge technology solutions that empower our clients to achieve their goals. With a broad international portfolio of combined companies, products, and leading technologies, we are actively changing the way the world works with documents to make work better and life simpler.

Customers like IBM, Autodesk, DocuSign, Boeing, Microsoft (and many more!) come to us to realize their web and mobile strategies for document management, editing, and collaboration as the #1-ranked commercial document SDK of choice for companies worldwide. As a result, you can find our document technology in thousands of solutions, including those of household names, used by millions across virtually every industry. Our XODO app alone has 25M unique installs -- and counting -- and the highest ratings among PDF productivity apps on the largest online app marketplaces.

Internally, we foster an atmosphere of opportunity, growth, and success for every individual amidst an exciting and challenging entrepreneurial culture. Career progression is based on merit, not tenure. Every member of our vibrant team is empowered to be a contributor, innovator, and successful leader.

Ready to join our team?

If you are interested in helping Apryse deliver on its commitments and taking your career to the next level, we invite you to apply online now. Additionally, we view the above section as a guide, not a checklist. We welcome diverse and non-traditional backgrounds and encourage you to apply even if you do not have every requirement listed.

The compensation for this position is commensurate upon experience, with a range between $90,000.00-$138,000.00 USD in on target earnings.

We are committed to a work environment that is inclusive to all and free of discrimination. It is our policy to be an equal opportunity employer without regard to race, color, religion, sex, age, national origin, disability, sexual orientation, gender identity or expression, genetic predisposition or carrier status, veteran status, citizenship status or any other factors prohibited by law. Apryse will provide reasonable accommodations for qualified individuals.

Top Skills

Elasticsearch
Java
Modern Js/Ts
Node.js
Opensearch
Playwright
Puppeteer
Python
React
Scrapy
SQL

Similar Jobs at Apryse

7 Days Ago
In-Office or Remote
6 Locations
100K-115K Annually
Senior level
100K-115K Annually
Senior level
Productivity • Software • App development • Automation
The Senior Business Analyst will analyze enterprise workflows, define business requirements, and manage program delivery, ensuring alignment with strategy and governance.
Top Skills: Crm PlatformsFinance PlatformsHr Platforms

What you need to know about the Charlotte Tech Scene

Ranked among the hottest tech cities in 2024 by CompTIA, Charlotte is quickly cementing its place as a major U.S. tech hub. Home to more than 90,000 tech workers, the city’s ecosystem is primed for continued growth, fueled by billions in annual funding from heavyweights like Microsoft and RevTech Labs, which has created thousands of fintech jobs and made the city a go-to for tech pros looking for their next big opportunity.

Key Facts About Charlotte Tech

  • Number of Tech Workers: 90,859; 6.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Lowe’s, Bank of America, TIAA, Microsoft, Honeywell
  • Key Industries: Fintech, artificial intelligence, cybersecurity, cloud computing, e-commerce
  • Funding Landscape: $3.1 billion in venture capital funding in 2024 (CED)
  • Notable Investors: Microsoft, Google, Falfurrias Management Partners, RevTech Labs Foundation
  • Research Centers and Universities: University of North Carolina at Charlotte, Northeastern University, North Carolina Research Campus

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account