San Juan, Puerto Rico
Data Engineer
Job Description:
***Positions posted by El Comeback are done on behalf of companies that we support in their search for candidates.***
Our client, INVID, is looking for a Data Engineer.
Job Description:
Job Title: Data Engineer
Schedule: Monday to Friday
Work Modality: Hybrid
Essential Duties and Responsibilities:
- Build labeling pipelines that join behavioral events to outcome data (sanctions designations, flag changes,detentions)
- Implement proxy labeling strategies that create training signal from observabl outcomes
- Build weak supervision infrastructure to combine multiple noisy labeling rules
- Create and maintain ML training datasets at scale
- Build data validation and quality monitoring systems
- Implement versioning for reproducible model training
- Integrate LRIT position data for prediction validation
- Build pipelines that compare predicted locations against actual LRIT reports
- Create feedback loops that improve model accuracy over time
- Scale data infrastructure as models and data sources grow
Experience:
- 4+ years data engineering experience
- Strong SQL skills, including complex joins across large datasets
- Experience with Spark, Airflow, or equivalent distributed processing frameworks
- Python for data processing and pipeline orchestration
- AWS experience
- Understanding of ML training data requirements
Education/Certifications
- Bachelors degree in computer science, Engineering, or related field
Desired Skills (Not Required)
- Experience with geospatial data (PostGIS, H3, spatial joins)
- Maritime, defense, or intelligence domain experience
- Experience with data labeling infrastructure or weak supervision
- Familiarity with real-time streaming data systems
***El Comeback is a non-profit program from ConPRmetidos that attracts and retains professional talent for Puerto Rico-based jobs. Register at elcomebackpr.org/registration-form to get matched with professional opportunities on the island.***