Earnbetter

Job Search Assistant

Data Engineer (Python/PySpark)

Dexian - DISYS • Columbia, MD 21046 • Posted 3 days ago

Boost your interview chances in seconds

Tailored resume, cover letter, and cheat sheet

Hybrid • Full-time • Senior Level

Job Highlights

Using AI ⚡ to summarize the original job post

The Data Engineer at Dexian will be responsible for supporting HIT activities in the Baltimore/Washington region, focusing on designing, developing, and maintaining a data analytics platform solution. This role involves working with a complex infrastructure including Azure Data Lake 2, Azure Data Factory, Spark, and Databricks, and creating new use cases through Databricks programming. The position requires a blend of technical proficiency, collaboration, and the ability to learn new technologies.

Responsibilities

  • Design, build, and maintain large, complex data processing pipelines that meet business requirements
  • Ensure the quality of deliverables by developing automated controls and performing unit, integration, and user acceptance testing
  • Develop scalable and re-usable frameworks for ingestion and transformation of large data sets
  • Troubleshoot and perform root cause analysis on data pipeline issues
  • Work with source system owners and business owners to incorporate business changes into data pipelines
  • Create comprehensive documentation of data workflows, processes, and infrastructure

Qualifications

Required

  • 4+ year college degree
  • 4+ years of working with data integration
  • Advanced programming skills in PySpark, Python, Spark SQL
  • Thorough understanding of Data Lakes, raw/enriched/curated layer concepts, and ETL within Azure framework
  • Proven development experience in building complex data pipelines for lakehouse/data warehouses using Agile methodology
  • Experience in architecture, design, and implementation using Databricks and Azure Data Factory

Preferred

  • Experience working with Databricks

Full Job Description

Position: Data Engineer

Location: Columbia, MD (Hybrid)

Job Type: C2H/Full-Time

Hiring Manager Notes:

Note:- Candidates should be local to DMV. Will need to be in Columbia, MD office one day per week.

This particular role will be HEAVY in Pyspark/Python language.

Interview Process - Initial screening with Manager (30 minutes); Team Interview (1 hour)

  • Tech stack required: PySpark, Python and SQL
  • Tech stack preferred: Databricks
  • Day to day:
    • New project, specific to Python Dev (Intermediate to advanced). Previously everything was all batch processing, over last couple years they have real time streaming of clinical data now w/ CEND system.
    • Things running 24/7 and shooting data directly to portal (50,000 users per month). RECENTLY created CEND (new alert system). New person will jump right into the service. Python and PySpark heavy
    • In office on Thursdays for now.
    • Needs to have good comm skills to be able to speak to non-technical people. How did project they have done help the business/stake holders?
    • Interviews will be technical as well as non-technical.

Job Description:

Job Summary:

The Data Engineer will be responsible for supporting HIT activities in the Baltimore/Washington region. The successful candidate will work to design, develop, and maintain a data analytics platform solution. Client is currently running an Azure Data Lake 2 / Azure Data Factory / Spark / Databricks infrastructure with approximately 300 production use cases that consolidate several streams of healthcare data. This role will primarily focus on the creation of new use cases through Databricks programming including communication, technical proficiency, collaboration, and willingness to learn new things. This role assists a senior data engineer to implement new ingestion pipelines and troubleshoot production issues.

Essential Duties and Responsibilities:

Include the following. Other duties may be assigned.

  • Design, build, and maintain large, complex data processing pipelines that meet business requirements
  • Ensure the quality of deliverables by developing automated controls and performing unit, integration, and user acceptance testing
  • Develop scalable and re-usable frameworks for ingestion and transformation of large data sets
  • Troubleshoot and perform root cause analysis on data pipeline issues
  • Work with source system owners and business owners to incorporate business changes into data pipelines
  • Create comprehensive documentation of data workflows, processes, and infrastructure

Qualifications:

To perform this job successfully, the incumbent must be able to perform each essential duty satisfactorily. The requirements listed below are representative of the knowledge, skill, and/or ability required. Reasonable accommodations may be made to enable individuals with disabilities to perform the essential functions.

  • Advanced programming skills in PySpark, Python, Spark SQL
  • Thorough understanding of Data Lakes, raw/enriched/curated layer concepts, and ETL within Azure framework
  • Proven development experience in building complex data pipelines for lakehouse/data warehouses using Agile methodology
  • Experience in architecture, design, and implementation using Databricks and Azure Data Factory

Required Experience/Education:

4+ year college degree (required)

4+ years of working with data integration (required)

Experience working with Databricks (preferred)

Dexian is a leading provider of staffing, IT, and workforce solutions with over 12,000 employees and 70 locations worldwide. As one of the largest IT staffing companies and the 2nd largest minority-owned staffing company in the U.S., Dexian was formed in 2023 through the merger of DISYS and Signature Consultants. Combining the best elements of its core companies, Dexian's platform connects talent, technology, and organizations to produce game-changing results that help everyone achieve their ambitions and goals.

Dexian's brands include Dexian DISYS, Dexian Signature Consultants, Dexian Government Solutions, Dexian Talent Development and Dexian IT Solutions. Visit

to learn more.

Dexian is an Equal Opportunity Employer that recruits and hires qualified candidates without regard to race, religion, sex, sexual orientation, gender identity, age, national origin, ancestry, citizenship, disability, or veteran status.