Data Engineer (Data Migration)

Pune, India
Full Time


About Fusemachines:

Fusemachines is a leading AI strategy, talent, and education services provider. Founded by Sameer Maskey Ph.D., Adjunct Associate Professor at Columbia University, Fusemachines has a core mission of democratizing AI. With a presence in 4 countries (Nepal, the United States, Canada, and the Dominican Republic and more than 250 full-time employees). Fusemachines seeks to bring its global expertise in AI to transform companies around the world.

Job Type: 

This is a full-time consulting position

Roles and Responsibilities:

  • Use a systematic approach to plan, create, and maintain data architectures while also keeping it aligned with business requirements.
  • Formulate a set of dataset processes, obtain data and store optimized data
  • Design and develop data pipelines to ingest, process and data warehouse data from sources including files, streams and databases.
  • Perform and oversee tasks such as writing scripts, calling APIs, web scraping, and writing SQL queries
  • Keep up-to-date with machine learning and its algorithms like the random forest, decision tree, k-means, and others.
  • Use big data tools like Apache Spark to generate valuable business insights for all types of industries
  • Pinpoint tasks where manual participation can be eliminated with automation.
  • Assist with tasks as needed by different projects

Basic Qualifications:

  • At least 6 years Total combined related work experience and completed higher education.
    • At least 1 year Data acquisition, data migration and/or other data engineering work experience, including the design, development, implementation and/or support of data models, data architectures, data warehouses and/or database
      • At least 5 years Additional work experience directly related to the duties of the job including:
        • Experience in writing complex SQL queries.
        • Have experience in a programming languages such as Java, Scala, and/or python
        • Familiarity and experience with Data warehousing.
        • Experience with Big query and other GCP services.
        • Experience working with complex and huge sets of data.
        • Experience with distributed data processing frameworks such as Apache Spark, and Hadoop.


  • Bachelors in Information Systems, Computer Science, Computer Engineering or Software Engineering.

Preferred Qualifications:

  • At least 2 years of SQL Scripting work experience
  • At least 2 years of EMR data migration import

Equal Employment Opportunity

We're proud to be an equal opportunity employer - and celebrate our employees' differences, including race, color, religion, sex, sexual orientation, gender identity, national origin, age, disability, and Veteran status. Differences make us better.


Apply for this position

We've received your resume. Click here to update it.
Attach resume as .pdf, .doc, .docx, .odt, .txt, or .rtf (limit 5MB) or Paste resume

Paste your resume here or Attach resume file

Human Check*