Wo diese Rolle verfügbar ist
Standardmäßig eingeklappt, damit die Stellenbeschreibung gut lesbar bleibt.
- New York, Vereinigte Staaten
- New York City, Vereinigte Staaten
Rollenübersicht von JobGrid
Senior Data Engineer at Mecka.Ai: New York, Vereinigte Staaten, New York City, Vereinigte Staaten; Vor Ort; IT; Data Engineer. JobGrid adds normalized role facts, source context, and a path to the employer application page so candidates can compare the listing before applying.
- Location and workplace: New York, Vereinigte Staaten, New York City, Vereinigte Staaten, Vor Ort
- Role classification: IT, Data Engineer
- Source freshness: checked by JobGrid on 2026-05-29.
- Application path: candidates continue to the employer application page with non-personal referral tags.
About Mecka AI
Mecka AI is building the data infrastructure layer for robotics and embodied AI.
We design and operate global systems for data capture, data labeling, and hardware-enabled workflows used by leading AI and robotics teams. A meaningful portion of our data includes video, sensor streams, and large unstructured datasets used to train and evaluate modern vision-language and robotics models.
Making this data reliable, structured, and usable at scale is core to how Mecka operates.
The Role
We're hiring a Senior Data Engineer to own and evolve Mecka's data infrastructure, pipelines, and internal analytics systems, with a focus on large-scale, unstructured data.
This is a high-ownership role. You'll build and maintain pipelines that handle everything from operational metrics to video and multimodal datasets, helping organize, clean, and prepare data for downstream use in analytics, operations, and model training.
You'll work closely with operations, engineering, research, and founders to ensure the company can trust and move quickly with its data.
What You'll Work On
Data Pipelines & Infrastructure
Design, build, and maintain production-grade data pipelines ingesting structured and unstructured data
Own pipeline reliability, backfills, schema evolution, and performance
Build pipelines that handle large volumes of unstructured data
Support data preparation workflows for vision-language and robotics models
Data Modeling, Organization & Querying
Design data models that support analytics, operations, and research
Organize and clean large datasets, including video and sensor data
Write and optimize complex SQL queries over large datasets
Establish clear metric definitions and data contracts
Dashboards & Internal Tools
Build dashboards used by operations, research, and leadership
Develop internal tools and reports to inspect data health, quality, and throughput
Make complex data understandable to non-technical users
Data Quality & Observability
Define and implement data quality checks across pipelines
Monitor data freshness, correctness, and completeness
Investigate and resolve data issues across systems
Improve trust in data across the organization
Cross-Functional Collaboration & Leadership
Partner closely with operations, engineering, research, and founders
Translate ambiguous requirements into reliable data systems
Set standards for data engineering practices as the company scales
Mentor and support other engineers as the team grows
Who You Are
Required Experience
6+ years of experience in data engineering or analytics engineering
Proven experience owning production data pipelines end-to-end
Advanced SQL skills and experience working with large datasets
Experience building dashboards and internal analytics tools
Comfortable operating as a senior individual contributor with broad ownership
Strong Signals
Experience working with video data, sensor data, or multimodal datasets
Experience organizing, cleaning, and pipelining data for ML training or labeling
Familiarity with data preparation workflows for ML training or evaluation
Experience with modern data warehouses (BigQuery, Snowflake, Redshift)
Experience in startups or fast-scaling environments
You Likely Have
Worked with messy, real-world, high-volume data
Built pipelines that balance flexibility and correctness
Debugged broken pipelines and late-arriving data
Strong intuition for what data quality actually means in practice
Experience enabling non-technical teams with data
Why This Role
Own the data backbone of Mecka's operations and research workflows
Work with large-scale video and multimodal data used in modern AI systems
High autonomy and trust from day one
Direct impact on how data is delivered, analyzed, and used downstream