Data Engineer (Data Migration)
About Fusemachines:
Fusemachines is a leading AI strategy, talent, and education services provider. Founded by Sameer Maskey Ph.D., Adjunct Associate Professor at Columbia University, Fusemachines has a core mission of democratizing AI. With a presence in 4 countries (Nepal, the United States, Canada, and the Dominican Republic and more than 250 full-time employees). Fusemachines seeks to bring its global expertise in AI to transform companies around the world.
Job Type:
This is a full-time consulting position
Roles and Responsibilities:
- Use a systematic approach to plan, create, and maintain data architectures while also keeping it aligned with business requirements.
- Formulate a set of dataset processes, obtain data and store optimized data
- Design and develop data pipelines to ingest, process and data warehouse data from sources including files, streams and databases.
- Perform and oversee tasks such as writing scripts, calling APIs, web scraping, and writing SQL queries
- Keep up-to-date with machine learning and its algorithms like the random forest, decision tree, k-means, and others.
- Use big data tools like Apache Spark to generate valuable business insights for all types of industries
- Pinpoint tasks where manual participation can be eliminated with automation.
- Assist with tasks as needed by different projects
Basic Qualifications:
- At least 6 years Total combined related work experience and completed higher education.
- At least 1 year Data acquisition, data migration and/or other data engineering work experience, including the design, development, implementation and/or support of data models, data architectures, data warehouses and/or database
- At least 5 years Additional work experience directly related to the duties of the job including:
- Experience in writing complex SQL queries.
- Have experience in a programming languages such as Java, Scala, and/or python
- Familiarity and experience with Data warehousing.
- Experience with Big query and other GCP services.
- Experience working with complex and huge sets of data.
- Experience with distributed data processing frameworks such as Apache Spark, and Hadoop.
- At least 5 years Additional work experience directly related to the duties of the job including:
- At least 1 year Data acquisition, data migration and/or other data engineering work experience, including the design, development, implementation and/or support of data models, data architectures, data warehouses and/or database
- Bachelors in Information Systems, Computer Science, Computer Engineering or Software Engineering.
Preferred Qualifications:
- At least 2 years of SQL Scripting work experience
- At least 2 years of EMR data migration import
Equal Employment Opportunity
We're proud to be an equal opportunity employer - and celebrate our employees' differences, including race, color, religion, sex, sexual orientation, gender identity, national origin, age, disability, and Veteran status. Differences make us better.