Back to Jobs
Operations Engineer
Crossing Hurdles · · Full-time
Operations Easy Apply
Apply Now
Posted 3 weeks ago · Job #38
About the Role

About This Opportunity

The field of agentic AI — where systems autonomously plan, reason across multiple steps, delegate sub-tasks, and coordinate resources to achieve complex goals — is one of the most rapidly advancing areas in global technology. Building robust evaluation frameworks and benchmark tasks for these systems is foundational work that will shape how the entire industry assesses AI capability, safety, and reliability for years to come.

This Operations Engineer role sits at the frontier of applied AI research. You will design and build the operational scenarios — logistics problems, incident response simulations, capacity planning challenges, project management tasks — that multi-agent AI systems are evaluated against. This requires both strong Python engineering skills and genuine domain expertise in operational problem-solving. You need to understand how real operational problems are structured, what makes them hard, and how to encode that difficulty into a form that AI evaluation can measure precisely.

Opportunities to work directly on multi-agent AI evaluation at this level are rare and represent significant career capital in one of technology's most consequential emerging fields.


Role Responsibilities

Design and develop multi-agent benchmark tasks involving planning, scheduling, and resource allocationCreate real-world operational scenarios (logistics, project management, incident response, capacity planning)

Build constraint-rich problem statements with multiple dependencies and variables

Develop Python-based scripts to evaluate feasibility, completeness, and optimality

Break down complex problems into structured sub-tasks for multi-agent systems

Model scenarios with timelines, dependencies, and resource constraints

Collaborate with teams to improve task quality, coverage, and evaluation rigor


Applying for This Role

  • Portfolio over credentials: A GitHub repository with well-designed benchmark tasks, constraint modelling scripts, or AI evaluation frameworks will carry more weight than a CV alone. Build it before you apply.
  • Domain complexity is what matters: The role requires operational domain knowledge — demonstrate understanding of scheduling algorithms, resource allocation problems, dependency networks, and constraint satisfaction. Reference specific operational domains where you have experience.
  • Python engineering depth is required: This is not a research role with occasional scripting. You need to build robust, well-documented evaluation infrastructure. Demonstrate software engineering discipline in your application.
  • Collaborative mindset: The role explicitly involves working with teams to improve task quality and evaluation rigour — show that you engage constructively with technical peers and incorporate feedback.
Requirements

Requirements

5+ years of experience in operations, project management, logistics, or supply chain

Strong understanding of constraints, dependencies, and scheduling logic

Proficiency in Python for validation and verification scripting

Strong structured problem-solving and decomposition skills

Ability to model real-world operational scenarios

Clear technical communication and documentation skills

Ability to work in a fast-paced environment and meet deadlines

Preferred Qualifications

Experience with optimization techniques (linear programming, constraint satisfaction, scheduling algorithms)

Background in operations research

Experience with simulation or modeling tools

Familiarity with AI planning systems or automated reasoning

Experience with AI benchmarks (e.g., SWE-bench, Terminal-bench)

Hands-on experience with Docker

Application Process

Apply via Easy Apply / shared link and complete the Interest Check Form (ICF)

Complete the take-home assessment (post-shortlisting)

Shortlisted candidates will be reviewed further

The team will connect with next steps

Benefits

Compensation: $15/hour

About Crossing Hurdles
Crossing Hurdles
Social Enterprise

Crossing Hurdles is a social enterprise focused on empowering individuals and communities through skills training, mentorship, and capacity-building programmes. The organisation partners with NGOs, public institutions, and private sector actors to help people overcome barriers to employment, education, and economic participation.

🧭
Application Guide for This Role
Tailored tips to help you stand out and prepare confidently
🔧 What Operations Hiring Managers Look For

Operations leaders hire for process thinking, cross-functional coordination, and measurable efficiency improvement. They want candidates who can diagnose bottlenecks, implement fixes, and sustain improvements without constant oversight — and who communicate trade-offs clearly before committing resources.

How to Stand Out
  • Lead with process improvement impact: throughput increased, error rate reduced, cost per unit lowered — numbers make the story concrete.
  • Show experience with ops tooling: project management software, ERP systems, BI dashboards, or workflow automation platforms.
  • Demonstrate cross-functional credibility: describe how you've worked alongside finance, sales, or engineering to implement an operational change.
  • Prepare a structured problem-solving example using a framework (5 Whys, DMAIC, A3) — ops interviews love process thinking made explicit.
Likely Interview Questions
  1. Walk me through a process you found and fixed — from identification to sustained improvement.
  2. How do you get buy-in from teams who are resistant to a process change?
  3. Describe how you'd prioritise three equally urgent operational problems with one team.
  4. How do you measure whether an operational improvement has actually stuck after 90 days?
Pro tip: Review Lean Six Sigma concepts (even at Yellow Belt level) before your interview — being able to name the methodology behind your process improvements adds professional credibility.
📄 About Full-Time Employment Roles

Full-time roles typically include benefits (health insurance, pension contributions, paid leave). During salary negotiation, always consider the total compensation package — benefits can be worth 20–30% on top of base salary. Ask specifically about probation period, performance review cadence, and remote/hybrid flexibility before signing.

✅ Before You Hit Submit
📝
Tailor your CV
Remove irrelevant roles. Match your language to the job description — ATS systems score keyword alignment.
💌
Write a real cover note
One paragraph that explains why this specific company, this specific role, right now. Generic notes go unread.
🔍
Research the company
Know their product, recent news, funding stage, and competitors. Bring one insight to your interview.
🔗
Clean up your LinkedIn
Make sure your profile matches your CV and your headline reflects the role you want, not the one you are leaving.
Job Overview
Salary Competitive
Type Full-time
Location
Category Operations
Posted Apr 27, 2026
Apply Now
Free Daily Digest
Stay ahead of the job market

New jobs, scholarships and career tips — delivered to your inbox daily. Unsubscribe any time.