Gramian Consulting Group

AI Evaluator - POLISH

🇵🇱 Zdalnie, Polska Zdalnie IT Kontrakt Średni poziom Opublikowano Cze 1, 2026
Lokalizacja Zdalnie, Polska
Tryb pracy Zdalnie
Forma zatrudnienia Kontrakt
Poziom doświadczenia Średni poziom
Kategoria IT
Kategoria IT Analityka
Język English
Opublikowano 1 czerwca 2026
Ostatnio sprawdzono 1 czerwca 2026
Kontekst JobGrid

Podsumowanie roli od JobGrid

AI Evaluator - POLISH at Gramian Consulting Group is a remote contract role in Poland for mid-level candidates in IT / Analytics. JobGrid normalizes the role facts, keeps the employer description separate, and sends candidates to the original public application page with non-personal referral parameters. This listing is part of JobGrid's Zdalne oferty pracy AI z publicznych stron firm.

  • Remote, Poland; workplace: Remote; employment type: Contract; seniority: Mid; category: IT / Analytics.
  • Source content is in English, and JobGrid preserves that original-language boundary while presenting the listing in English.
  • Source freshness is current: posted on 2026-06-01 and last checked on 2026-06-01.
  • No salary was provided in the payload, so JobGrid does not add salary context or estimates here.

About Us

Gramian Consultancy is a boutique consultancy specializing in IT professional services and engineering talent solutions. With a strong background in software engineering and leadership, we help companies build high-performing teams by matching them with professionals who truly fit their needs.

This opening is on behalf of one of our clients, and we’ll work closely with you to make the process clear and straightforward.

Role Overview

We are looking for Polish-speaking AI Content Analysts to support evaluation of a new personalization capability within a leading AI assistant platform.

In this role, you will design realistic conversational prompts based on your own context and experiences, then rigorously evaluate how effectively the AI uses personal signals (such as prior conversations or activity context) to produce relevant, grounded, and helpful responses.

This position combines creative prompt design, analytical evaluation, and structured quality assessment, making it ideal for candidates with experience in AI evaluation, annotation, content review, or analytical research roles.

Responsibilities

  • Design and run short multi-turn conversations (typically 1–5 turns) intended to test AI personalization behavior
  • Create prompts grounded in realistic personal scenarios to evaluate contextual understanding
  • Review AI responses to determine whether personalization is correctly applied
  • Check grounding quality to ensure the model does not invent unsupported claims about the user
  • Evaluate integration quality — confirming personal signals are used naturally (not forced or robotic)
  • Compare two responses side-by-side and determine which is more helpful, natural, and relevant
  • Write clear, structured rationales explaining rankings and referencing specific conversation turns
  • Verify debug information showing whether correct data sources were used
  • Maintain strict workflow hygiene (including deleting evaluation conversations when required)

Notes:

  • The role is 100% remote, working hours within your local time zone (this is a MUST)
  • 30-40 hours/week commitment. Paid by hours logged and approved.
  • Contracting Model
  • Duration: 1 month (possible extension)