Retrieval that works
Hybrid retrieval, reranking, chunking strategy. You picked an approach because the numbers said so — not because it was the default.
AI integration, RAG and retrieval, agent orchestration, fine-tuning, and the eval harness that keeps it honest. Verified by build, not buzzwords.
Hiring for: AI Engineer, ML Engineer, RAG Engineer, Applied Scientist.
18-question adaptive quiz across LLM fundamentals, RAG, agents, evals, tool use, prompt caching, and MCP — plus a graded RAG-pipeline project. ~12h total.
The microskills the rubric tests against.
AI Engineering roles in the SignalAI marketplace.
A verified score is good for 18 months before re-certification.
What you'll prove
Retrieval that works
Hybrid retrieval, reranking, chunking strategy. You picked an approach because the numbers said so — not because it was the default.
Agents and orchestration
Tool use, multi-step plans, recovery from bad calls. You can name the failure modes and show how your system survives them.
Evals as a first-class artifact
An eval set sourced from real queries, scored on the metrics that matter, with before/after numbers when you change the system.
Fine-tuning and adaptation
When to fine-tune vs prompt vs retrieve. You've actually shipped one of each and can defend the call.
Sample project brief
Pick a corpus and ship a working RAG pipeline (BM25 + dense + reranker) with an eval harness that lets a reviewer reproduce your numbers.
Deliverables
How grading works
Each criterion is scored 0–5 with a written rationale. Your score is the weighted sum, published with the rubric so an employer can see exactly what you did.
Build quality
Repo runs end-to-end. Pipeline is wired correctly. Ingestion is idempotent.
End-to-end evals
Real query set, real metrics, before/after numbers, one documented ablation.
Code clarity
Right-sized modules. Secrets via env. Eval harness sits next to the pipeline.
Build narrative
Names two real tradeoffs and resolves them with numbers.
Live Engineering jobs
Commercial Counsel, Infrastructure Security
New York City, NY; San Francisco, CA; Seattle, WA; Washington, DC
Staff Infrastructure Engineer, Cluster Infrastructure
San Francisco, CA | New York City, NY | Seattle, WA
Data Scientist, Product
San Francisco, CA | New York City, NY
Data Scientist, Supply
San Francisco, CA | New York City, NY
Senior Business Systems Analyst, Finance Systems
San Francisco, CA | Seattle, WA
Staff Software Engineer, Cloud Inference Safeguards
San Francisco, CA | Seattle, WA
What a verified profile looks like
See an example scorecard with the composite score, rubric breakdown, project artifact links, and quiz top-microskills. Yours will look exactly like this.
See a sample profileGet verified
The AIB Foundations assessment grades all three rubrics — including Engineering — on the same bar. Standalone Engineering Foundations ships in cohort 02.