Mecka.Ai

Senior Data Engineer

🇺🇸 New York, Vereinigte Staaten, New York City, Vereinigte Staaten Vor Ort IT Veröffentlicht Mai 19, 2026

Bewerben

Arbeitsort Vor Ort

Kategorie IT

IT-Kategorie Data Engineer

Sprache English

Veröffentlicht 19. Mai 2026

Zuletzt geprüft 29. Mai 2026

Wo diese Rolle verfügbar ist

Standardmäßig eingeklappt, damit die Stellenbeschreibung gut lesbar bleibt.

2 Standorte

Vereinigte Staaten

New York, Vereinigte Staaten
New York City, Vereinigte Staaten

JobGrid-Kontext

Rollenübersicht von JobGrid

Senior Data Engineer at Mecka.Ai: New York, Vereinigte Staaten, New York City, Vereinigte Staaten; Vor Ort; IT; Data Engineer. JobGrid adds normalized role facts, source context, and a path to the employer application page so candidates can compare the listing before applying.

Location and workplace: New York, Vereinigte Staaten, New York City, Vereinigte Staaten, Vor Ort
Role classification: IT, Data Engineer
Source freshness: checked by JobGrid on 2026-05-29.
Application path: candidates continue to the employer application page with non-personal referral tags.

About Mecka AI

Mecka AI is building the data infrastructure layer for robotics and embodied AI.

We design and operate global systems for data capture, data labeling, and hardware-enabled workflows used by leading AI and robotics teams. A meaningful portion of our data includes video, sensor streams, and large unstructured datasets used to train and evaluate modern vision-language and robotics models.

Making this data reliable, structured, and usable at scale is core to how Mecka operates.

The Role

We're hiring a Senior Data Engineer to own and evolve Mecka's data infrastructure, pipelines, and internal analytics systems, with a focus on large-scale, unstructured data.

This is a high-ownership role. You'll build and maintain pipelines that handle everything from operational metrics to video and multimodal datasets, helping organize, clean, and prepare data for downstream use in analytics, operations, and model training.

You'll work closely with operations, engineering, research, and founders to ensure the company can trust and move quickly with its data.

What You'll Work On

Data Pipelines & Infrastructure

Design, build, and maintain production-grade data pipelines ingesting structured and unstructured data
Own pipeline reliability, backfills, schema evolution, and performance
Build pipelines that handle large volumes of unstructured data
Support data preparation workflows for vision-language and robotics models

Data Modeling, Organization & Querying

Design data models that support analytics, operations, and research
Organize and clean large datasets, including video and sensor data
Write and optimize complex SQL queries over large datasets
Establish clear metric definitions and data contracts

Dashboards & Internal Tools

Build dashboards used by operations, research, and leadership
Develop internal tools and reports to inspect data health, quality, and throughput
Make complex data understandable to non-technical users

Data Quality & Observability

Define and implement data quality checks across pipelines
Monitor data freshness, correctness, and completeness
Investigate and resolve data issues across systems
Improve trust in data across the organization

Cross-Functional Collaboration & Leadership

Partner closely with operations, engineering, research, and founders
Translate ambiguous requirements into reliable data systems
Set standards for data engineering practices as the company scales
Mentor and support other engineers as the team grows

Who You Are

Required Experience

6+ years of experience in data engineering or analytics engineering
Proven experience owning production data pipelines end-to-end
Advanced SQL skills and experience working with large datasets
Experience building dashboards and internal analytics tools
Comfortable operating as a senior individual contributor with broad ownership

Strong Signals

Experience working with video data, sensor data, or multimodal datasets
Experience organizing, cleaning, and pipelining data for ML training or labeling
Familiarity with data preparation workflows for ML training or evaluation
Experience with modern data warehouses (BigQuery, Snowflake, Redshift)
Experience in startups or fast-scaling environments

You Likely Have

Worked with messy, real-world, high-volume data
Built pipelines that balance flexibility and correctness
Debugged broken pipelines and late-arriving data
Strong intuition for what data quality actually means in practice
Experience enabling non-technical teams with data

Why This Role

Own the data backbone of Mecka's operations and research workflows
Work with large-scale video and multimodal data used in modern AI systems
High autonomy and trust from day one
Direct impact on how data is delivered, analyzed, and used downstream