Semantic Search API · Vector + Neural Search

Semantic search API for jobs, built on direct-from-source data

Vector search, neural search, and natural language queries one API, one embedding model, 4M+ live listings sourced directly from company career pages.

768d Embedding dimensions
<50ms
4M+ job listings
300k+ comapnies indexed
POST/v2/jobs/vsearch
200 OK · 42ms

QUERY (NATURAL LANGUAGE)

"senior backend engineer building distributed payment systems with Go and Kafka, remote-friendly EU"

SOCRATES V2 EMBEDDING · 768 DIMENSIONS

[-0.0294, 0.0613, -0.0413, 0.0187, -0.0822, 0.0154, ...762 more]

Senior Payment Platform Engineer

Klarna · Stockholm · Remote EU

0.92

Staff Engineer, Money Movement

Wise · London · Hybrid

0.88

Senior Software Engineer, Payments Infrastructure

Adyen · Amsterdam · Hybrid

0.84
Indexed: Product Designer @ Airbnb
Indexed: Staff Eng @ Linear
Indexed: Head of Sales @ Ramp
Indexed: Backend Dev @ Stripe
Indexed: AI Researcher @ OpenAI
Indexed: Senior PM @ Notion
Indexed: DevOps Lead @ Datadog
Indexed: ML Engineer @ Anthropic
The Endpoints

Three semantic search endpoints, one embedding model.

Pure similarity, hybrid with filters, or raw vector input — all backed by the same socrates v2 index.

Solution Illustration
Vector Search
POST /v2/jobs/search

Pure semantic similarity.

Pass a natural language query, a job ID, or your own pre-computed vectors. Get the top-K most similar listings ranked by cosine similarity.

dim_1dim_2query0.970.92EngineeringDesignData / MLtop-K results
  • NL search bars
  • similar-jobs recommendations
  • discovery feeds
  • AI agents
Neural Search
POST /v2/jobs/search

Semantic + filters, in one request.

Send a natural language query under vector and structured filters under lexical — location, salary, experience, industry, posting date. Combined ranking, single round trip.

query"ML Engineer"filterslocation: NYCsocrates-v2+ lexical filtersranked0.98ML Eng · NYC0.95AI Eng · NYC0.91DS Eng · NYC
  • production search backends
  • narrow semantic queries
  • filtered AI agents
Raw Vector Input
POST /v2/jobs/search

Bring your own embeddings.

Pass raw 768-dim vectors directly via the vectors parameter. Skip re-encoding when you've cached embeddings in your own pipeline.

vectors[0] — 768-dimd00.34d1-0.89d20.13d30.60d4-0.45d50.72d6-0.23d70.56positivenegative... 760 more dimensions✓ skip re-encodingcached pipeline
  • batch matching
  • cached embeddings
  • custom recommender stacks
The Endpoints

Many product features, one API

What the Semantic Search API actually powers in production.

Natural language search

A search bar that understands what users mean.

Let users type "remote backend role at a Series B fintech with equity" instead of clicking through filters. The API matches role, seniority, company stage, geography, and compensation as a single query.

Implementation: POST /v2/jobs/vsearch with search_type: "summary" and a free-form query string.
Similar jobs / recommendations

A "Similar jobs" rail that actually works.

Pass a job ID, get the top-K most similar listings. ML engineer roles cluster with Applied Researcher; data scientist clusters with quant analyst. Powers detail-page rails and "you might also like" feeds.

Implementation: POST /v2/jobs/vsearch with search_type: "job" and a job_id. Returns scores 0.0–1.0.
AI job agents & copilots

The data layer behind AI career agents.

Feed user intent — natural language goals, conversation context, scraped requirements — directly to the API. Match user descriptions to live listings without prompt engineering against keywords.

Implementation: POST /v2/jobs/vsearch or POST /v2/jobs/neural-search depending on whether you need filters.
Filtered semantic search

Semantic queries that respect real constraints.

"ML engineer" should match Applied Researcher AND filter to remote, $150k+, senior, posted in the last 30 days. Neural Search runs both in one request with combined ranking.

Implementation: POST /v2/jobs/neural-search with both vector and lexical blocks. The recommended endpoint when filters matter.
Custom embedding pipelines

Bring your own vectors. Reuse, batch, recommend.

Cache vectors once, query them many times. Match cached user-intent vectors against fresh job postings every hour without re-encoding. Powers batch matching, scheduled job alerts, and custom recommender stacks.

Implementation: POST /v2/jobs/neural-search with vector.vectors: [[768 floats]]. Pair with the Job Export API to mirror the index in your warehouse.
Try it live

Five-line natural language search.

Page 1 of every search endpoint returns results without an API key. Prototype before you sign up.

diamond

JSON in, JSON out.

Works with any HTTP client. Official Python SDK — pip install hirebase.

diamond

Score thresholding built in.

Every result returns a similarity score 0.0 - 1.0. Pass score to drop low-quality matches.

Background Noise
Neural Search · semantic + filters
Request
# Filtered semantic search in one request
curl -X POST https://api.hirebase.org/v2/jobs/neural-search \
-H "Content-Type: application/json" \
-H "x-api-key: YOUR_API_KEY" \
-d '{
"vector": {
"query": "senior backend engineer building
distributed systems with Go and Kafka"
},
"lexical": {
"location_types": ["Remote", "Hybrid"],
"experience": ["Senior"],
"industry": "Tech, Software & IT Services",
"days_ago": 30,
"limit": 10
}
}'
Response payload
200 OK
# Returns ranked results with similarity scores
{
"jobs": [
{
"_id": "6814bw99fc2284gt4777f21a",
"job_title": "Senior Payment Platform Engineer",
"company_name": "Klarna",
"location_type": "Remote",
"experience_level": "Senior",
"vector_score": 0.92,
"date_posted": "2026-04-15"
}, ...
],
"total_count": 847,
"total_pages": 85
}
Under The Hood

A semantic search API tuned specifically
for jobs.

Domain-tuned embeddings preserve role, seniority, function, and skill-level distinctions that general-purpose models flatten.

[ 01 ] Embedding model

socrates v2 — 768d, job-domain tuned.

Proprietary 768-dimensional model trained on tens of millions of job postings. Model name and version returned on every embedding response so you can pin to a specific release.

[ 02 ] Similarity scoring

Cosine similarity · 0.0–1.0 floats.

Vector Search returns `score`; Neural Search returns `vector_score`. Threshold low-quality results with the optional `score` request parameter.

[ 03 ] Three input modes

Query, job_id, or raw vectors.

Pass natural language (encoded server-side), the Mongo ObjectId of a reference job (uses the precomputed embedding), or your own raw 768-dimensional vector. Same endpoint, three workflows.

[ 04 ] Accuracy controls

Tune recall vs latency.

The `accuracy` parameter (`low` / `medium` / `high`) trades exhaustiveness for response time. Default is `medium`.

[ 05 ] Result count

top_k up to 500.

Use `top_k` to control candidate set size before pagination — useful for re-ranking with your own model. Pairs with `limit`, `page`, or `offset`.

[ 06 ] Free first page

Page 1 without an API key.

Both Vector Search and Neural Search return page 1 without authentication. Prototype before signing up. Page 2+ requires a free key.

Prompting Guid

How to write queries that
actually rank well.

Richer prompts give the embedding model more dimensions to match against. A well-written query consistently outperforms structured filters alone.

[ Tip 01 ]

Embed location, level, and stack directly into the prompt.

Don't relegate context to filter parameters when the embedding model can match it from the query itself.

Better:"Senior SWE based in Santa Clara, California, building Kubernetes-native infra"
[ Tip 02 ]

Describe the kind of company alongside the role.

Stage, size, and industry character match through the company entity linkage on every job.

Better:"Senior Director of Supply Chain in Pharmaceuticals, small manufacturing company with global ambition"
[ Tip 03 ]

Spell out technical background and what they want next.

For agent and copilot workflows, give the model both the candidate's current state and their target.

Better:"5+ years quantitative analysis at financial firms, expertise in PyTorch, ready to lead and mentor juniors in NYC"
[ Tip 04 ]

Use Neural Search when filters matter.

Vector Search accepts lexical parameters but does not currently honor them. For 30-day windows, salary floors, industry, or location filtering to actually narrow results, use Neural Search with both vector and lexical blocks.

Recommended:Production search backends · scheduled agents · filtered AI workflows
vs. Alternatives

Why teams choose Hirebase over generic
search APIs.

vs. building it yourself with general-purpose embeddings and a vector DB, or buying keyword-only job posting feeds.

Keyword-Only Job Posting APIs
Aggregator-derived feeds
DIY Embedding Stack
Generic embeddings + vector DB
Hirebase
Hirebase Semantic Search
Job-tuned, hosted, single API
01
Embedding model
None — keyword matching only
General-purpose (text-embedding-3, etc.)
socrates v2 — tuned on millions of job postings
02
Job data source
Mostly aggregator feeds, recycled
Bring your own — you build the crawler
Direct from 300k+ company career pages and 80+ ATS platforms
03
Setup time
API call
Weeks — crawl, encode, embed, index, host
Five minutes — page 1 free, no signup
04
Hybrid filtering
Filters work; no semantic layer
You implement re-ranking yourself
Neural Search: vector + lexical in one request
05
Domain fidelity
Term-level matching only
"ML engineer" matches "ML engineer" — flattens role distinctions
Preserves role, seniority, function, skill relationships
06
Index freshness
24–72 hour lag typical
You maintain it — re-encoding cost grows with scale
Most postings indexed in <1hr · embeddings refreshed automatically
07
Latency
Varies; usually fine
Embedding round-trip + vector DB query
Sub-50ms median — single hop
08
Pricing posture
Cheap, often unusable
Embedding fees + vector DB fees + crawler infra
Free tier · usage-based · no per-token costs
Common Questions

Frequently asked questions about the
Semantic Search API.

Get Started

Search by meaning, not keywords.

Page 1 is free. Run a real semantic search in your terminal in 60 seconds.

Share feedback - DM or email [email protected]
© 2026 Hirebase. All rights reserved.