Mecka.Ai

Senior Data Engineer

🇺🇸 New York, Vereinigte Staaten, New York City, Vereinigte Staaten Vor Ort IT Veröffentlicht Mai 19, 2026
Arbeitsort Vor Ort
Kategorie IT
IT-Kategorie Data Engineer
Sprache English
Veröffentlicht 19. Mai 2026
Zuletzt geprüft 29. Mai 2026

Wo diese Rolle verfügbar ist

Standardmäßig eingeklappt, damit die Stellenbeschreibung gut lesbar bleibt.

2 Standorte
Vereinigte Staaten
  • New York, Vereinigte Staaten
  • New York City, Vereinigte Staaten
JobGrid-Kontext

Rollenübersicht von JobGrid

Senior Data Engineer at Mecka.Ai: New York, Vereinigte Staaten, New York City, Vereinigte Staaten; Vor Ort; IT; Data Engineer. JobGrid adds normalized role facts, source context, and a path to the employer application page so candidates can compare the listing before applying.

  • Location and workplace: New York, Vereinigte Staaten, New York City, Vereinigte Staaten, Vor Ort
  • Role classification: IT, Data Engineer
  • Source freshness: checked by JobGrid on 2026-05-29.
  • Application path: candidates continue to the employer application page with non-personal referral tags.

About Mecka AI

Mecka AI is building the data infrastructure layer for robotics and embodied AI.

We design and operate global systems for data capture, data labeling, and hardware-enabled workflows used by leading AI and robotics teams. A meaningful portion of our data includes video, sensor streams, and large unstructured datasets used to train and evaluate modern vision-language and robotics models.

Making this data reliable, structured, and usable at scale is core to how Mecka operates.

The Role

We're hiring a Senior Data Engineer to own and evolve Mecka's data infrastructure, pipelines, and internal analytics systems, with a focus on large-scale, unstructured data.

This is a high-ownership role. You'll build and maintain pipelines that handle everything from operational metrics to video and multimodal datasets, helping organize, clean, and prepare data for downstream use in analytics, operations, and model training.

You'll work closely with operations, engineering, research, and founders to ensure the company can trust and move quickly with its data.

What You'll Work On

Data Pipelines & Infrastructure

  • Design, build, and maintain production-grade data pipelines ingesting structured and unstructured data

  • Own pipeline reliability, backfills, schema evolution, and performance

  • Build pipelines that handle large volumes of unstructured data

  • Support data preparation workflows for vision-language and robotics models

Data Modeling, Organization & Querying

  • Design data models that support analytics, operations, and research

  • Organize and clean large datasets, including video and sensor data

  • Write and optimize complex SQL queries over large datasets

  • Establish clear metric definitions and data contracts

Dashboards & Internal Tools

  • Build dashboards used by operations, research, and leadership

  • Develop internal tools and reports to inspect data health, quality, and throughput

  • Make complex data understandable to non-technical users

Data Quality & Observability

  • Define and implement data quality checks across pipelines

  • Monitor data freshness, correctness, and completeness

  • Investigate and resolve data issues across systems

  • Improve trust in data across the organization

Cross-Functional Collaboration & Leadership

  • Partner closely with operations, engineering, research, and founders

  • Translate ambiguous requirements into reliable data systems

  • Set standards for data engineering practices as the company scales

  • Mentor and support other engineers as the team grows

Who You Are

Required Experience

  • 6+ years of experience in data engineering or analytics engineering

  • Proven experience owning production data pipelines end-to-end

  • Advanced SQL skills and experience working with large datasets

  • Experience building dashboards and internal analytics tools

  • Comfortable operating as a senior individual contributor with broad ownership

Strong Signals

  • Experience working with video data, sensor data, or multimodal datasets

  • Experience organizing, cleaning, and pipelining data for ML training or labeling

  • Familiarity with data preparation workflows for ML training or evaluation

  • Experience with modern data warehouses (BigQuery, Snowflake, Redshift)

  • Experience in startups or fast-scaling environments

You Likely Have

  • Worked with messy, real-world, high-volume data

  • Built pipelines that balance flexibility and correctness

  • Debugged broken pipelines and late-arriving data

  • Strong intuition for what data quality actually means in practice

  • Experience enabling non-technical teams with data

Why This Role

  • Own the data backbone of Mecka's operations and research workflows

  • Work with large-scale video and multimodal data used in modern AI systems

  • High autonomy and trust from day one

  • Direct impact on how data is delivered, analyzed, and used downstream