AI / Systems Engineer

Own the desktop capture, trace generation, and evaluation evidence infrastructure behind NeoHuman. You will work at the boundary between operating systems, desktop apps, multimodal data capture, LLM-based normalization, and assessment pipelines. The core challenge: building a trustworthy system that reconstructs a candidate's real work session across macOS and Windows without relying on a browser extension.

Responsibilities

  • - Build cross-platform desktop session capture for macOS and Windows: app activity, accessibility trees, OCR snapshots, clipboard events, keyboard/mouse, file interactions
  • - Turn noisy low-level signals into clean human-readable session timelines
  • - Design data model and ingestion pipeline for AI competency evaluation evidence
  • - Build robust fallback logic across AX/UIA, OCR, keyboard triggers, and video segments
  • - Use LLMs carefully to summarise and structure session traces without hallucinating
  • - Own Electron packaging, native helpers, permission flows, auto-updates, and crash handling
  • - Maintain PostgreSQL/Supabase schemas, storage flows, async jobs, and replay assets

Benefits & Perks

- Fully remote (global) - High ownership — own major parts of capture and evaluation stack directly - Research + product: publish benchmarks, ship production software, build AI eval data infrastructure - Technical founders who code daily - Hard, unusual problem at the frontier of AI assessment

Requirements

  • - Strong production engineering experience with TypeScript and Node.js
  • - Experience building desktop or system-level software (Electron + native helpers ideal)
  • - Strong PostgreSQL — schema design, migrations, indexing, data quality constraints
  • - Comfortable with OS APIs, permissions, background processes, cross-platform differences
  • - Hands-on LLM API experience: structured output, summarisation, hallucination control
  • - Security and privacy mindset around sensitive user activity, PII, and audit requirements

About Emergences Labs

Emergences Labs builds NeoHuman — a desktop-first challenge and proctoring system that observes how people use AI tools, software, and workflows during real tasks, turning sessions into reliable AI competency evaluation evidence. Published the AgentIF-OneDay benchmark. Small, research-driven team that values demonstrated capability over credentials.

Company Size

1–10 employees employees

Industry

AI Infrastructure / Developer Tools

Ready to apply?

Take the next step in your career and apply for this position today.

Apply for this Position