Role overview
This role focuses on strengthening advanced AI systems through rigorous technical review and evaluation. As a Software Engineering Expert, you will assess AI-generated code, validate algorithms, refine prompts, and contribute to structured benchmarking efforts used by leading AI research teams.
The work is analytical and detail-oriented. You will operate at the intersection of software engineering and AI model evaluation, applying real-world engineering standards to assess correctness, clarity, robustness, and technical depth.
This role is suited for experienced engineers who are comfortable reviewing unfamiliar code quickly, identifying subtle issues, and providing precise, structured feedback.
What you’ll actually be doing
- Reviewing AI-generated code for correctness, edge cases, performance, and maintainability
- Refining and improving technical prompts to produce clearer, more deterministic outputs
- Validating algorithms, data structures, and architectural decisions for accuracy
- Writing structured feedback on solution quality, clarity, and completeness
- Tagging and organizing content by topic, language, and complexity level
- Supporting benchmarking initiatives that measure model reasoning and coding capability
- Identifying technical flaws, logical gaps, or incorrect assumptions in generated outputs
Who this role is for
- Engineers with strong fundamentals in algorithms, data structures, and software design
- Developers comfortable reviewing code across multiple languages
- Professionals who can clearly articulate why something is technically correct or incorrect
- Engineers who enjoy analysis, validation, and technical critique
- Detail-oriented individuals who can follow evaluation rubrics precisely
Who this role is likely NOT for
- Junior developers without hands-on debugging and validation experience
- Engineers who primarily focus on shipping features without reviewing others’ code
- Candidates uncomfortable providing written technical feedback
- Developers who lack experience reading code outside their primary stack
- Those looking for purely product-building or feature-delivery roles
Technical background
- Minimum 2+ years of experience in software engineering or closely related technical work
- Bachelor’s degree in Computer Science, Software Engineering, or a related technical discipline (advanced degree preferred)
- Strong proficiency in at least one major programming language such as Python, JavaScript, Java, or C++
- Solid understanding of debugging, testing strategies, and code validation practices
- Ability to evaluate algorithmic efficiency and correctness
- Strong written communication skills for structured technical feedback
Project scope
Immediate start with potential extension based on performance and research requirements
Project-based engagement focused on AI research and evaluation
Initial duration approximately 1–2 months
Weekly time commitment typically ranges between 15–25 hours, with potential flexibility depending on project needs
