All research
arXiv Preprint

Mira-Embeddings-V1: Domain-Adapted Semantic Reranking for Recruitment via LLM-Synthesized Data

Zhaohua Liang, Zhilin Wang, Renjie Cao, Yining Zhang April 2026 OpenJobs AI · San Francisco
Abstract

Candidate sourcing for recruiters is best viewed as a two-stage retrieval and reranking pipeline with recall as the primary objective under a limited review budget. In the primary setting studied in this paper, an upstream production retriever first returns a candidate shortlist for each job description (JD), and our goal is to rerank that shortlist so that qualified candidates appear as high as possible. We present mira-embeddings-v1, a semantic reranking system for the recruitment domain that reshapes the embedding space with large language model (LLM)-synthesized training data and further corrects boundary confusions with a lightweight reranking head.

Starting from real JDs, we build a five-stage prompt pipeline to generate diverse positive and hard-negative samples that sculpt the model's semantic space from multiple angles. We then apply a two-round LoRA adaptation: JD–JD contrastive training followed by JD–CV triplet alignment on a heterogeneous text JD–CV dataset. Importantly, these gains are pursued without large-scale manually labeled industrial training pairs: a modest set of real JDs is expanded into supervision at the task level through LLM synthesis. Finally, a BoundaryHead multi-layer perceptron (MLP) reranks the Top-K results to further distinguish between roles that share the same title but differ in scope.

On a local pool of 300 real JDs with candidates returned by an upstream production retriever, mira-embeddings-v1 improves Recall@50 from 68.89% (baseline) to 77.55% while lifting Precision@10 from 35.77% to 39.62%. On a supportive global pool protocol over 44,138 candidates judged by a Qwen3-32B rubric, it achieves Recall@200 of 0.7047 versus 0.5969 for the baseline. These results show that supervision synthesized by LLMs with data efficiency, plus reranking with boundary awareness, yields robust gains without introducing a heavy cross-encoder in our current pipeline.

68.89% → 77.55%
Recall@50 on 300 real JDs
35.77% → 39.62%
Precision@10 on the same local pool
0.5969 → 0.7047
Recall@200 over 44,138 candidates (Qwen3-32B rubric)
Method

LLM-synthesized supervision, then boundary-aware reranking.

  • Five-stage prompt pipeline — generates diverse positive and hard-negative samples from real JDs to sculpt the semantic space.
  • Two-round LoRA adaptation — JD–JD contrastive training followed by JD–CV triplet alignment on a heterogeneous JD–CV dataset.
  • BoundaryHead MLP — reranks the Top-K to separate roles that share a title but differ in scope, seniority, or function.
recruitment matchingsemantic rerankingLLM-synthesized databoundary-aware reranking