Semantic Search API · Vector + Neural Search

Semantic search API for jobs, built on direct-from-source data

Vector search, neural search, and natural language queries one API, one embedding model, 4M+ live listings sourced directly from company career pages.

Get API Key Read Docs

768d Embedding dimensions

<50ms

4M+ job listings

300k+ comapnies indexed

POST/v2/jobs/vsearch

200 OK · 42ms

QUERY (NATURAL LANGUAGE)

"senior backend engineer building distributed payment systems with Go and Kafka, remote-friendly EU"

↓

SOCRATES V2 EMBEDDING · 768 DIMENSIONS

[-0.0294, 0.0613, -0.0413, 0.0187, -0.0822, 0.0154, ...762 more]

↓

Senior Payment Platform Engineer

Klarna · Stockholm · Remote EU

0.92

Staff Engineer, Money Movement

Wise · London · Hybrid

0.88

Senior Software Engineer, Payments Infrastructure

Adyen · Amsterdam · Hybrid

0.84

Indexed: Product Designer @ Airbnb

Indexed: Staff Eng @ Linear

Indexed: Head of Sales @ Ramp

Indexed: Backend Dev @ Stripe

Indexed: AI Researcher @ OpenAI

Indexed: Senior PM @ Notion

Indexed: DevOps Lead @ Datadog

Indexed: ML Engineer @ Anthropic

The Endpoints

Three semantic search endpoints, one embedding model.

Pure similarity, hybrid with filters, or raw vector input — all backed by the same socrates v2 index.

Vector SearchPOST /v2/jobs/search

Pure semantic similarity.

Pass a natural language query, a job ID, or your own pre-computed vectors. Get the top-K most similar listings ranked by cosine similarity.

NL search bars
similar-jobs recommendations
discovery feeds
AI agents

Neural SearchPOST /v2/jobs/search

Semantic + filters, in one request.

Send a natural language query under vector and structured filters under lexical — location, salary, experience, industry, posting date. Combined ranking, single round trip.

production search backends
narrow semantic queries
filtered AI agents

Raw Vector InputPOST /v2/jobs/search

Bring your own embeddings.

Pass raw 768-dim vectors directly via the vectors parameter. Skip re-encoding when you've cached embeddings in your own pipeline.

batch matching
cached embeddings
custom recommender stacks

The Endpoints

Many product features, one API

What the Semantic Search API actually powers in production.

Natural language search

A search bar that understands what users mean.

Let users type "remote backend role at a Series B fintech with equity" instead of clicking through filters. The API matches role, seniority, company stage, geography, and compensation as a single query.

Implementation: POST /v2/jobs/vsearch with search_type: "summary" and a free-form query string.

Similar jobs / recommendations

A "Similar jobs" rail that actually works.

Pass a job ID, get the top-K most similar listings. ML engineer roles cluster with Applied Researcher; data scientist clusters with quant analyst. Powers detail-page rails and "you might also like" feeds.

Implementation: POST /v2/jobs/vsearch with search_type: "job" and a job_id. Returns scores 0.0–1.0.

AI job agents & copilots

The data layer behind AI career agents.

Feed user intent — natural language goals, conversation context, scraped requirements — directly to the API. Match user descriptions to live listings without prompt engineering against keywords.

Implementation: POST /v2/jobs/vsearch or POST /v2/jobs/neural-search depending on whether you need filters.

Filtered semantic search

Semantic queries that respect real constraints.

"ML engineer" should match Applied Researcher AND filter to remote, $150k+, senior, posted in the last 30 days. Neural Search runs both in one request with combined ranking.

Implementation: POST /v2/jobs/neural-search with both vector and lexical blocks. The recommended endpoint when filters matter.

Custom embedding pipelines

Bring your own vectors. Reuse, batch, recommend.

Cache vectors once, query them many times. Match cached user-intent vectors against fresh job postings every hour without re-encoding. Powers batch matching, scheduled job alerts, and custom recommender stacks.

Implementation: POST /v2/jobs/neural-search with vector.vectors: [[768 floats]]. Pair with the Job Export API to mirror the index in your warehouse.

Try it live

Five-line natural language search.

Page 1 of every search endpoint returns results without an API key. Prototype before you sign up.

JSON in, JSON out.

Works with any HTTP client. Official Python SDK — pip install hirebase.

Score thresholding built in.

Every result returns a similarity score 0.0 - 1.0. Pass score to drop low-quality matches.

Neural Search · semantic + filters

Request

# Filtered semantic search in one request
curl -X POST https://api.hirebase.org/v2/jobs/neural-search \
-H "Content-Type: application/json" \
-H "x-api-key: YOUR_API_KEY" \
-d '{
"vector": {
"query": "senior backend engineer building
distributed systems with Go and Kafka"
},
"lexical": {
"location_types": ["Remote", "Hybrid"],
"experience": ["Senior"],
"industry": "Tech, Software & IT Services",
"days_ago": 30,
"limit": 10
}
}'

Response payload

200 OK

# Returns ranked results with similarity scores
{
"jobs": [
{
"_id": "6814bw99fc2284gt4777f21a",
"job_title": "Senior Payment Platform Engineer",
"company_name": "Klarna",
"location_type": "Remote",
"experience_level": "Senior",
"vector_score": 0.92,
"date_posted": "2026-04-15"
}, ...
],
"total_count": 847,
"total_pages": 85
}

Under The Hood

A semantic search API tuned specifically
for jobs.

Domain-tuned embeddings preserve role, seniority, function, and skill-level distinctions that general-purpose models flatten.

[ 01 ] Embedding model

socrates v2 — 768d, job-domain tuned.

Proprietary 768-dimensional model trained on tens of millions of job postings. Model name and version returned on every embedding response so you can pin to a specific release.

[ 02 ] Similarity scoring

Cosine similarity · 0.0–1.0 floats.

Vector Search returns `score`; Neural Search returns `vector_score`. Threshold low-quality results with the optional `score` request parameter.

[ 03 ] Three input modes

Query, job_id, or raw vectors.

Pass natural language (encoded server-side), the Mongo ObjectId of a reference job (uses the precomputed embedding), or your own raw 768-dimensional vector. Same endpoint, three workflows.

[ 04 ] Accuracy controls

Tune recall vs latency.

The `accuracy` parameter (`low` / `medium` / `high`) trades exhaustiveness for response time. Default is `medium`.

[ 05 ] Result count

top_k up to 500.

Use `top_k` to control candidate set size before pagination — useful for re-ranking with your own model. Pairs with `limit`, `page`, or `offset`.

[ 06 ] Free first page

Page 1 without an API key.

Both Vector Search and Neural Search return page 1 without authentication. Prototype before signing up. Page 2+ requires a free key.

Prompting Guid

How to write queries that
actually rank well.

Richer prompts give the embedding model more dimensions to match against. A well-written query consistently outperforms structured filters alone.

[ Tip 01 ]

Embed location, level, and stack directly into the prompt.

Don't relegate context to filter parameters when the embedding model can match it from the query itself.

Better:"Senior SWE based in Santa Clara, California, building Kubernetes-native infra"

[ Tip 02 ]

Describe the kind of company alongside the role.

Stage, size, and industry character match through the company entity linkage on every job.

Better:"Senior Director of Supply Chain in Pharmaceuticals, small manufacturing company with global ambition"

[ Tip 03 ]

Spell out technical background and what they want next.

For agent and copilot workflows, give the model both the candidate's current state and their target.

Better:"5+ years quantitative analysis at financial firms, expertise in PyTorch, ready to lead and mentor juniors in NYC"

[ Tip 04 ]

Use Neural Search when filters matter.

Vector Search accepts lexical parameters but does not currently honor them. For 30-day windows, salary floors, industry, or location filtering to actually narrow results, use Neural Search with both vector and lexical blocks.

Recommended:Production search backends · scheduled agents · filtered AI workflows

vs. Alternatives

Why teams choose Hirebase over generic
search APIs.

vs. building it yourself with general-purpose embeddings and a vector DB, or buying keyword-only job posting feeds.

Keyword-Only Job Posting APIs

Aggregator-derived feeds

DIY Embedding Stack

Generic embeddings + vector DB

Hirebase Semantic Search

Job-tuned, hosted, single API

Embedding model

None — keyword matching only

General-purpose (text-embedding-3, etc.)

socrates v2 — tuned on millions of job postings

Job data source

Mostly aggregator feeds, recycled

Bring your own — you build the crawler

Direct from 300k+ company career pages and 80+ ATS platforms

Setup time

API call

Weeks — crawl, encode, embed, index, host

Five minutes — page 1 free, no signup

Hybrid filtering

Filters work; no semantic layer

You implement re-ranking yourself

Neural Search: vector + lexical in one request

Domain fidelity

Term-level matching only

"ML engineer" matches "ML engineer" — flattens role distinctions

Preserves role, seniority, function, skill relationships

Index freshness

24–72 hour lag typical

You maintain it — re-encoding cost grows with scale

Most postings indexed in <1hr · embeddings refreshed automatically

Latency

Varies; usually fine

Embedding round-trip + vector DB query

Sub-50ms median — single hop

Pricing posture

Cheap, often unusable

Embedding fees + vector DB fees + crawler infra

Free tier · usage-based · no per-token costs

Common Questions

Frequently asked questions about the
Semantic Search API.

Get Started

Search by meaning, not keywords.

Page 1 is free. Run a real semantic search in your terminal in 60 seconds.

Get API Key Read Docs Talk to us

Semantic search API for jobs, built on direct-from-source data