Job Openings Senior Data Engineer

About the job Senior Data Engineer

Senior Data Engineer

Remote · LATAM · Full-Time Contract

LATAM (Remote) Full-Time Contract ASAP English Required

ABOUT THE ROLE

We are looking for a Senior Data Engineer with deep expertise in AWS, batch processing, and real-time streaming architectures. This is a hands-on

role — you will design and own scalable data pipelines serving both high-throughput batch workloads (GBs to TBs) and low-latency streaming

systems. You will collaborate with analysts, data scientists, and business teams, and help mentor junior engineers to raise the bar across the data

platform.

Batch Processing

Design & implement batch pipelines for large-scale processing (GBs to

TBs)

Build ETL/ELT workflows using PySpark, SQL, Glue, and EMR

Optimize jobs for performance, partitioning, and cost

Manage data ingestion from DBs, files, and APIs into the data lake

Ensure data quality, consistency, and reliability

Real-Time / Streaming

Design & implement real-time pipelines using Apache Flink (Scala

preferred)

Process high-throughput streams with low-latency requirements

Implement stateful processing, windowing, and event-time handling

Handle late/out-of-order events; exactly-once / at-least-once semantics

Configure checkpointing, fault tolerance, and recovery

Integrate with Kafka or Kinesis

CORE RESPONSIBILITIES

Build and maintain data lake / lakehouse architectures on AWS

(S3-based)

Integrate multiple data sources — APIs, SaaS platforms, databases

Ensure data governance, security, and access control (IAM)

Implement monitoring, logging, and alerting for all pipelines

Collaborate with analysts, data scientists, and business teams

Mentor junior engineers and enforce best practices

TECH STACK

Python SQL Scala PySpark Apache Flink Apache Kafka AWS Glue Amazon EMR Amazon S3 AWS Lambda

Amazon Athena Amazon Kinesis Delta Lake / Iceberg

REQUIRED QUALIFICATIONS

5+ years of experience in Data Engineering

Strong skills in Python, SQL, and Scala

Hands-on AWS experience: S3, Glue, Athena, Lambda, EMR,

Kinesis, IAM

Strong batch processing experience with Spark / PySpark at scale

Proven real-time streaming experience with Apache Flink in

production

Solid understanding of batch vs streaming trade-offs and event-driven

architectures

Experience building data lakes or data warehouses in production

Experience integrating external systems via APIs

Distributed systems fundamentals: fault tolerance, partitioning,

consistency

WHAT MAKES A STRONG CANDIDATE

Has built both batch AND real-time pipelines in production — not just one or the other

Knows when to apply each paradigm: batch for cost-efficient large volumes, streaming for low-latency real-time use cases

Experience handling millions of records in batch and high event throughput in streaming systems

Strong debugging and performance optimization skills across distributed data systems

Hands-on mindset — you build things and own them end-to-end, not just design them

NICE TO HAVE

AWS certifications (Data Analytics Specialty, Solutions Architect)

Experience with Delta Lake, Apache Iceberg, or open table formats

Familiarity with dbt for transformation workflows

Experience with orchestration tools (Apache Airflow, AWS Step

Functions)

Background in IoT, telemetry, or event-heavy domains

OpenTelemetry or advanced observability experience

THIS ROLE IS NOT FOR YOU IF

You have only worked with batch OR only with streaming — this role requires both

You need a fully defined architecture and runbook before making engineering decisions

You are not comfortable owning pipelines end-to-end in production without daily oversight

You optimize for theoretical elegance before proving a real cost or reliability problem exists

You are not comfortable communicating directly in English with international teams

WHY COMMITTED STAFF

Committed Staff connects the top 1% of LATAM talent with high-growth U.S. companies. We don't just place candidates — we build committed teams

that scale. You'll get direct access to a real data engineering challenge with meaningful scope, full-time remote work aligned with U.S. time zones,

competitive LATAM-market compensation, and a growing network of senior tech professionals across Latin America.