Earnbetter

Job Search Assistant

Logo

AWS Data Engineer

iSpace, Inc. • Torrance, CA 90504 • Posted 2 days ago via LinkedIn

Boost your interview chances in seconds

Tailored resume, cover letter, and cheat sheet

Hybrid • Full-time • Temporary • Senior Level

Job Highlights

Using AI ⚡ to summarize the original job post

The Senior Data Engineer at iSpace, Inc. is responsible for developing and maintaining data integration solutions using AWS Glue/EMR, Lambda, Redshift, and other tools. This role involves ensuring data quality and integrity, optimizing data integration processes, supporting business intelligence and analytics, and maintaining documentation and compliance. The position requires proficiency in PySpark, Apache Spark, and Python for data processing, as well as the ability to work less than 60% in-office in Torrance, CA.

Responsibilities

  • Develop and maintain data integration solutions using AWS Glue/EMR, Lambda, Redshift
  • Design and implement data integration workflows
  • Ensure data is accurately and efficiently extracted, transformed, and loaded into target systems
  • Validate and cleanse data to maintain high data quality
  • Ensure data quality and integrity by implementing monitoring, validation, and error handling mechanisms within data pipelines
  • Enhance performance and scalability of data integration processes on AWS cloud infrastructure
  • Identify and resolve performance bottlenecks and optimize data processing
  • Regularly review and refine integration processes to improve efficiency
  • Translate business requirements to technical specifications and code data pipelines
  • Ensure timely availability of integrated data for business intelligence and analytics
  • Collaborate with data analysts and business stakeholders to meet their data requirements
  • Document all data integration processes, workflows, and technical specifications
  • Ensure compliance with data governance policies, industry standards, and regulatory requirements

Qualifications

Required

  • Proficiency in PySpark, Apache Spark, and Python for data processing large datasets
  • Experience with AWS Glue/EMR, Lambda, Redshift
  • Knowledge of data quality and integrity best practices
  • Ability to optimize data integration processes for performance, scalability, and cost-efficiency
  • Strong problem-solving skills to identify and resolve performance bottlenecks
  • Ability to translate business requirements into technical specifications and code data pipelines
  • Understanding of data governance policies and regulatory requirements

About iSpace, Inc.

Ispace is a private lunar robotic exploration company that focuses on developing micro-robotic technology for low-cost lunar transportation services and surface exploration. The company aims to map, process, and deliver resources in cislunar space, with an emphasis on utilizing lunar water resources to enhance life on Earth and expand human presence in space.

Full Job Description

Title: Data Engineer

Location: Torrance, CA

Duration: 18 Months

Hybrid less than 60% in office

NOTE: only accepting candidates that can work on iSpace W2 (no c2c or 1099)


Job Description:


Daily Tasks Performed

  • Develop and Maintain Data Integration Solutions:
  • Design and implement data integration workflows using AWS Glue/EMR, Lambda, Redshift
  • Demonstrate proficiency in PySpark, Apache Spark and Python for data processing large datasets
  • Ensure data is accurately and efficiently extracted, transformed, and loaded into target systems.
  • Ensure Data Quality and Integrity:
  • Validate and cleanse data to maintain high data quality.
  • Ensure data quality and integrity by implementing monitoring, validation, and error handling mechanisms within data pipelines
  • Optimize Data Integration Processes:
  • Enhance performance, optimization of data workflows to meet SLAs, scalability of data integration processes and cost-efficiency on AWS cloud infrastructure.
  • Identify and resolve performance bottlenecks, fine-tuning queries, and optimizing data processing to enhance Redshift's performance
  • Regularly review and refine integration processes to improve efficiency.
  • Support Business Intelligence and Analytics:
  • Translate business requirements to technical specifications and coded data pipelines
  • Ensure timely availability of integrated data for business intelligence and analytics.
  • Collaborate with data analysts and business stakeholders to meet their data requirements.
  • Maintain Documentation and Compliance:
  • Document all data integration processes, workflows, and technical & system specifications.
  • Ensure compliance with data governance policies, industry standards, and regulatory requirements.