LXT Logo

LXT

Data Engineer

Posted 4 Days Ago
Be an Early Applicant
Remote or Hybrid
Hiring Remotely in Cairo
Mid level
Remote or Hybrid
Hiring Remotely in Cairo
Mid level
This role involves designing, developing, and maintaining data pipelines for data collection and annotation, ensuring data quality and efficiency in processes.
The summary above was generated by AI

LXT is an emerging leader in AI training data to power intelligent technology for global organizations. In partnership with an international network of contributors, LXT collects and annotates data across multiple modalities with the speed, scale, and agility required by the enterprise. Our global expertise spans over 145 countries and more than 1,000 language locales. Founded in 2010, LXT is headquartered in Toronto, Canada with a presence in the United States, UK, Egypt, India, Turkey, and Australia. The company serves customers in North America, Europe, Asia Pacific, and the Middle East. 


We are currently seeking a talented and motivated Mid-level Data Engineer to join our dynamic Data Engineering team. As a Data Engineer, you will play an important role in designing, developing, and maintaining data pipelines for our data collection, annotation, and quality evaluation projects, ensuring efficient data import and export processes. LXT technical calibers are expected to have a high level of ownership, and to be Hands-On engineers that take over the tasks until it passes the finish line.

Responsibilities:

  • Data Transformation and Integration: Perform data transformation and integration tasks to ensure data from various sources are accurately processed and made ready for annotation projects and for delivery to clients per client requirements.
  • Data Manipulation: Ensure efficient data storage, retrieval, and optimization to support data annotation workflows.
  • Data Quality Assurance: Implement data quality checks and validation processes to ensure the accuracy, consistency, and integrity of data used in annotation projects.
  • Data Pipeline Design and Development: Collaborate with the Data Engineering team to design and implement robust and scalable data pipelines for importing and exporting data used in our data annotation projects.
  • Performance Optimization: Identify and address performance bottlenecks in data pipelines to enhance the speed and efficiency of data import and export processes.
  • Automation and Process Improvement: Continuously seek opportunities to automate manual processes and improve data annotation workflows for increased productivity.
  • Collaboration and Communication: Work closely with cross-functional teams, including data annotation teams, backend developers, and project managers, to understand project requirements and provide timely data support.
  • Documentation: Maintain comprehensive documentation of data pipelines, processes, and data structures to facilitate knowledge sharing and seamless project handovers.
  • Troubleshooting and Support: Address and resolve data-related issues, providing technical support to data annotation teams when required.
  • Stay Updated on Emerging Technologies: Stay abreast of industry trends, tools, and technologies related to data engineering, and propose innovative solutions for data annotation projects.

Qualifications:

  • Bachelor’s degree in computer science, Data Engineering, or a related field.
  • Proven experience as a Data Engineer, with 4 years of hands-on experience in data pipeline design, data transformation, pipeline orchestration, and data integration, particularly for unstructured and semi-structured data.
  • Proficiency in programming languages such as Python, SQL, or Scala, and experience with data manipulation libraries and frameworks.
  • Experience with AirFlow, N8N is a plus.
  • Experience with Ruby is a big plus.
  • Knowledge and experience with machine learning projects is a big plus.
  • Solid knowledge of data storage and database management systems, including relational and NoSQL databases.
  • Familiarity with data visualization tools and techniques to facilitate data understanding and analysis.
  • Experience with AWS QuickSight and AWS Athena is a plus.
  • Solid understanding of data quality and data governance principles.
  • Familiarity with Data Lake concepts and with Apache Iceberg.
  • Experience with cloud-based data platforms, such as AWS, GCP, or Azure, is a plus.
  • Strong problem-solving skills with a keen eye for detail.
  • Excellent communication and collaboration skills, with the ability to work effectively in a team-oriented environment.
  • A passion for data engineering and a desire to contribute to impactful data annotation projects.

Additional information:


LXT is an equal opportunity employer and ensures that no applicant is subject to less favorable treatment on the grounds of gender, gender identity, marital status, race, color, nationality, ethnicity, age, sexual orientation, socio-economic, responsibilities for dependents, or physical or mental disability. Any hiring decision is made on the basis of skills, qualifications, and experiences.

We measure our success as a business, not only by delivering great products and services and continually increasing our assets under administration and market share but also by how we positively impact people, society, and the planet.

Similar Jobs

12 Days Ago
Remote
Mid level
Mid level
Fintech • HR Tech • Payments • Financial Services
Design and maintain data pipelines while optimizing data warehouses, collaborating with analysts and scientists, and ensuring data integrity through quality measures.
Top Skills: Apache AirflowPythonSQL
5 Days Ago
Remote or Hybrid
200M-200M Annually
Mid level
200M-200M Annually
Mid level
Information Technology • Mobile • Consulting
The role involves building data lakes, developing data pipelines, ensuring data quality, collaborating with teams for analytics, and maintaining data dashboards.
Top Skills: AIAirflowDbtDockerGCPGreat ExpectationKubernetesLookerLuigiMlMongoDBNoSQLPrefectPysparkPythonScalaSQLTerraform
13 Days Ago
Remote
Senior level
Senior level
Other
The Senior Data Engineer will design and optimize scalable data pipelines and architecture, ensuring data quality and governance while leading technical solutions for analytics and AI/ML.
Top Skills: AWSAzureEltETLGCPJavaKafkaLookerPower BIPythonScalaSpark StreamingSQLTableau

What you need to know about the Charlotte Tech Scene

Ranked among the hottest tech cities in 2024 by CompTIA, Charlotte is quickly cementing its place as a major U.S. tech hub. Home to more than 90,000 tech workers, the city’s ecosystem is primed for continued growth, fueled by billions in annual funding from heavyweights like Microsoft and RevTech Labs, which has created thousands of fintech jobs and made the city a go-to for tech pros looking for their next big opportunity.

Key Facts About Charlotte Tech

  • Number of Tech Workers: 90,859; 6.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Lowe’s, Bank of America, TIAA, Microsoft, Honeywell
  • Key Industries: Fintech, artificial intelligence, cybersecurity, cloud computing, e-commerce
  • Funding Landscape: $3.1 billion in venture capital funding in 2024 (CED)
  • Notable Investors: Microsoft, Google, Falfurrias Management Partners, RevTech Labs Foundation
  • Research Centers and Universities: University of North Carolina at Charlotte, Northeastern University, North Carolina Research Campus

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account