🔍

PostgreSQL Full-Text Search Functions

Complete reference for PostgreSQL full-text search functions covering tsvector document vectors, tsquery query parsing, relevance ranking, and headline generation. Includes GIN index usage, text search configuration, and dictionary-based normalisation. Updated for PostgreSQL 16.

9 functions

Jump to function

phraseto_tsquery plainto_tsquery setweight to_tsquery to_tsvector ts_headline ts_rank tsvector || tsvector websearch_to_tsquery

What are PostgreSQL Full-Text Search Functions?

PostgreSQL full-text search functions convert documents and queries into tsvector and tsquery types for efficient lexeme-based searching. to_tsvector() normalises text into searchable lexemes, to_tsquery() parses a search query, ts_rank() provides relevance scoring, and ts_headline() generates highlighted result snippets. Full-text search with GIN indexes is significantly faster than ILIKE or regex for large text datasets.

phraseto_tsquery

PG 8.3+→ tsquery

Converts text to a tsquery that requires the words to appear in sequence (phrase search) using the <-> operator.

DeveloperData EngDBA

Signatures

phraseto_tsquery ( query text ) → tsquery

phraseto_tsquery ( config regconfig, query text ) → tsquery

Parameters

Parameter	Type	Description
config	regconfig	Text search configuration
query	text	Phrase to search for

Examples

sql

SELECT phraseto_tsquery('english', 'quick brown fox');

→'quick' <-> 'brown' <-> 'fox'

sql

SELECT * FROM articles WHERE tsv @@ phraseto_tsquery('english', 'machine learning');

→Articles with 'machine' immediately followed by 'learning'

Words must appear in order with no gaps

sql

SELECT phraseto_tsquery('english', 'full text search');

→'full' <-> 'text' <-> 'search'

Only matches documents with these words in order

sql

SELECT * FROM docs WHERE to_tsvector('english', content) @@ phraseto_tsquery('english', 'machine learning');

→(docs containing exact phrase)

⚠Anti-Pattern— Using plainto_tsquery when phrase order matters

`plainto_tsquery('english', 'machine learning')` produces `'machin' & 'learn'` — it matches documents with both words anywhere, not necessarily adjacent. Use `phraseto_tsquery` when the exact sequence of words is required.

✓ Instead: -- Bad: matches docs with 'machine' and 'learning' anywhere, in any order SELECT * FROM docs WHERE tsv @@ plainto_tsquery('english', 'machine learning'); -- Good: requires adjacent sequence SELECT * FROM docs WHERE tsv @@ phraseto_tsquery('english', 'machine learning');

Use `phraseto_tsquery` when the user is searching for a specific phrase rather than individual words. This ensures 'machine learning' matches the compound term, not documents with 'machine' and 'learning' far apart.

example

SELECT * FROM docs WHERE tsv @@ phraseto_tsquery('english', 'primary key constraint');

→Documents with the exact phrase 'primary key constraint'

Common Gotchas

⚠

Sequences don't roll back — gaps appear after any aborted transaction

SERIAL/SEQUENCE values are consumed even if the transaction rolls back. You'll see gaps in IDs — this is expected and correct behavior.

plainto_tsquery

PG 8.3+→ tsquery

Converts plain text to a tsquery, treating all words as an implicit AND. Safe for user-supplied input — no special syntax needed.

DeveloperData EngDBA

Signatures

plainto_tsquery ( query text ) → tsquery

plainto_tsquery ( config regconfig, query text ) → tsquery

Parameters

Parameter	Type	Description
config	regconfig	Text search configuration
query	text	Plain text search terms (spaces = AND)

Examples

sql

SELECT plainto_tsquery('english', 'quick brown fox');

→'quick' & 'brown' & 'fox'

sql

SELECT * FROM docs WHERE body_tsv @@ plainto_tsquery('english', $1);

→Full text search from user input

Stop words removed, remaining terms ANDed

sql

SELECT plainto_tsquery('english', 'the quick brown fox');

→'quick' & 'brown' & 'fox'

Ranked full-text search from user input

sql

SELECT title FROM docs WHERE to_tsvector('english', content) @@ plainto_tsquery('english', $1) ORDER BY ts_rank(to_tsvector('english', content), plainto_tsquery('english', $1)) DESC;

→(ranked search results)

⚠Anti-Pattern— Using plainto_tsquery when phrase or OR search is needed

`plainto_tsquery` always treats all words as AND — users cannot express 'cat OR dog' or exact phrases. Offer `websearch_to_tsquery` for search boxes where users expect Google-style syntax.

✓ Instead: -- Bad: user types 'cat OR dog' → treated as AND, finds nothing with all three words SELECT * FROM docs WHERE tsv @@ plainto_tsquery('english', 'cat OR dog'); -- Parsed as: 'cat' & 'or' & 'dog' (or just 'cat' & 'dog' if 'or' is stop word) -- Good: respects OR keyword SELECT * FROM docs WHERE tsv @@ websearch_to_tsquery('english', 'cat OR dog');

`plainto_tsquery` is ideal for basic search boxes where users type words separated by spaces. For more advanced syntax (quoted phrases, minus for exclusion), use `websearch_to_tsquery`.

example

SELECT title, ts_rank(search_vector, q) AS rank FROM articles, plainto_tsquery('english', $1) AS q WHERE search_vector @@ q ORDER BY rank DESC LIMIT 10;

→Top 10 ranked search results

setweight

PG 8.3+→ tsvector

Assigns a weight ('A', 'B', 'C', or 'D') to each lexeme in a tsvector. Used to give different importance to different parts of a document.

DeveloperData EngDBA

Signature

setweight ( tsvector, "char" ) → tsvector

Parameters

Parameter	Type	Description
tsvector	tsvector	Vector to assign weights to
weight_label	"char"	'A' (highest) through 'D' (lowest, default)

Examples

sql

SELECT setweight(to_tsvector('english', 'hot chocolate'), 'A');

→'chocol':2A 'hot':1A

sql

SELECT setweight(to_tsvector(title), 'A') || setweight(to_tsvector(tags_str), 'B') || setweight(to_tsvector(body), 'C') FROM posts;

→Multi-field weighted vector

Title matches will rank higher than body matches

sql

SELECT setweight(to_tsvector('english', title), 'A') || setweight(to_tsvector('english', body), 'C') AS doc FROM articles;

→Combined tsvector with title weighted higher

Maintain weighted tsvector column for fast search

sql

UPDATE articles SET search_vec = setweight(to_tsvector('english', coalesce(title,'')), 'A') || setweight(to_tsvector('english', coalesce(tags,'')), 'B') || setweight(to_tsvector('english', coalesce(body,'')), 'C');

→(rows updated)

⚠Anti-Pattern— Applying setweight after concatenation — weights are lost

Calling `setweight(tsvector1 || tsvector2, 'A')` applies the weight to the already-concatenated result, overwriting any individual weights. Apply `setweight` to each part before concatenating.

✓ Instead: -- Bad: sets 'A' on everything — individual weights lost SELECT setweight(to_tsvector('english', title) || to_tsvector('english', body), 'A'); -- Good: weight each part independently, then concatenate SELECT setweight(to_tsvector('english', title), 'A') || setweight(to_tsvector('english', body), 'B');

For multi-column search with weights, compute once in a generated column: `GENERATED ALWAYS AS (setweight(to_tsvector(title), 'A') || setweight(to_tsvector(body), 'B')) STORED`. Index the generated column with GIN.

example

CREATE INDEX ON articles USING GIN(search_vector);
-- Generated column computed above
SELECT * FROM articles WHERE search_vector @@ q ORDER BY ts_rank(search_vector, q) DESC;

→Indexed weighted full-text search

to_tsquery

PG 8.3+→ tsquery

Converts a search query text to a tsquery, applying stemming and stop word removal. Input must use tsquery operators (&, |, !, <->).

DeveloperData EngDBA

Signatures

to_tsquery ( query text ) → tsquery

to_tsquery ( config regconfig, query text ) → tsquery

Parameters

Parameter	Type	Description
config	regconfig	Text search configuration
query	text	Search query with operators: & (AND), \| (OR), ! (NOT), <-> (phrase)

Examples

sql

SELECT to_tsquery('english', 'jumping & dogs');

→'jump' & 'dog'

sql

SELECT to_tsquery('english', 'fat | cat');

→'fat' | 'cat'

sql

SELECT to_tsquery('cat & !dog');

→'cat' & !'dog'

Boolean combination with negation

sql

SELECT to_tsquery('english', 'database & (postgres | mysql) & !oracle');

→'databas' & ( 'postgr' | 'mysql' ) & !'oracl'

⚠Anti-Pattern— Passing raw user input to to_tsquery

`to_tsquery` requires strict tsquery syntax — a bare space or unsupported character raises an error at runtime. Never pass unvalidated user input to it. Use `websearch_to_tsquery` or `plainto_tsquery` for user-facing search.

✓ Instead: -- Bad: user types 'dogs running' → ERROR: syntax error in tsquery SELECT * FROM articles WHERE tsv @@ to_tsquery('english', user_input); -- Good: safe for arbitrary user input SELECT * FROM articles WHERE tsv @@ websearch_to_tsquery('english', user_input);

`to_tsquery` requires proper tsquery syntax — passing raw user input will error on spaces and special chars. Use `plainto_tsquery` (implicit AND) or `websearch_to_tsquery` (Google-like syntax) for user input.

example

SELECT * FROM articles WHERE search_vector @@ websearch_to_tsquery('english', 'running fast dogs');

→Articles about running, fast, and dogs

to_tsvector

PG 8.3+→ tsvector

Converts a text document to a tsvector, normalizing words to their lexeme forms and removing stop words.

DeveloperData EngDBA

Signatures

to_tsvector ( document text ) → tsvector

to_tsvector ( config regconfig, document text ) → tsvector

Parameters

Parameter	Type	Description
config	regconfig	Text search configuration (e.g., 'english', 'simple'). Defaults to the server default configuration.
document	text	Text document to parse and normalize

Examples

sql

SELECT to_tsvector('english', 'The quick brown fox jumps over the lazy dog');

→'brown':3 'dog':9 'fox':4 'jump':5 'lazi':8 'quick':2

sql

SELECT to_tsvector('simple', 'Hello World');

→'hello':1 'world':2

Combines title and body into one searchable tsvector

sql

SELECT to_tsvector('english', title || ' ' || body) FROM articles WHERE id = 1;

→'articl':3,8 'content':5 'introduct':2 'sampl':1

Rank documents by relevance to search term

sql

SELECT ts_rank(to_tsvector('english', content), to_tsquery('english', 'postgres')) FROM docs ORDER BY 1 DESC;

→(rows ranked by relevance)

⚠Anti-Pattern— Computing to_tsvector at query time instead of storing it

Calling `to_tsvector` inside a WHERE clause forces PostgreSQL to re-parse and normalize every row on every query, making full-table scans the only option. Store the vector in a generated column and index it instead.

✓ Instead: -- Bad: recomputed on every query SELECT * FROM articles WHERE to_tsvector('english', body) @@ plainto_tsquery('search'); -- Good: generated column + GIN index ALTER TABLE articles ADD COLUMN tsv tsvector GENERATED ALWAYS AS (to_tsvector('english', coalesce(body,''))) STORED; CREATE INDEX articles_tsv_gin ON articles USING GIN(tsv); SELECT * FROM articles WHERE tsv @@ plainto_tsquery('english', 'search');

Pass the language explicitly: `to_tsvector('english', ...)`. The default config may not match your content language, leading to poor search quality. Use 'simple' to skip stemming for proper nouns or code.

example

ALTER TABLE articles ADD COLUMN search_vector tsvector GENERATED ALWAYS AS (to_tsvector('english', coalesce(title,'') || ' ' || coalesce(body,''))) STORED;

→Auto-maintained search vector column

Common Gotchas

⚠

LIKE is case-sensitive; ILIKE is not — and LIKE is faster

LIKE 'hello%' will not match 'Hello'. Use ILIKE for case-insensitive pattern matching, but expect a performance cost.

⚠

Arrays are 1-indexed in PostgreSQL, not 0-indexed

PostgreSQL arrays start at index 1 by default. array[0] returns NULL, not the first element — silently wrong.

⚠

Implicit type casts in indexes — your index may not be used

WHERE col = 5 may not use an index on col (text type) because the integer 5 is cast to text, preventing index use.

ts_headline

PG 8.3+→ text

Generates a highlighted summary of a document showing search term matches in context (like a search snippet).

DeveloperData EngDBA

Signatures

ts_headline ( document text, query tsquery [, options text] ) → text

ts_headline ( config regconfig, document text, query tsquery [, options text] ) → text

Parameters

Parameter	Type	Description
config	regconfig	Text search configuration
document	text	Original document text
query	tsquery	The search query
options	text	Comma-separated options: StartSel, StopSel, MaxWords, MinWords, ShortWord, HighlightAll, MaxFragments, FragmentDelimiter

Examples

sql

SELECT ts_headline('english', body, to_tsquery('english', 'database'), 'StartSel=<b>, StopSel=</b>') FROM docs WHERE id = 1;

→"...uses a database to store..."

sql

SELECT ts_headline('english', content, q, 'MaxFragments=2, FragmentDelimiter=" ... "') FROM articles, to_tsquery('english', 'search index') q WHERE tsv @@ q;

→Search snippet with highlights

Multiple highlighted fragments with custom length

sql

SELECT ts_headline('english', content, to_tsquery('PostgreSQL'), 'MaxFragments=3,MaxWords=15') FROM docs WHERE id = 1;

→'...working with PostgreSQL indexes...queries in PostgreSQL...'

HTML-ready highlighting for web display

sql

SELECT title, ts_headline('english', body, websearch_to_tsquery('english', 'full text search'), 'StartSel=<mark>,StopSel=</mark>') FROM docs WHERE body_vec @@ websearch_to_tsquery('english', 'full text search');

→Title with highlighted matches

⚠Anti-Pattern— Running ts_headline on the full document body without limiting result set first

`ts_headline` is CPU-intensive — it re-parses and tokenises the raw document text. Applying it to thousands of rows returned before a LIMIT multiplies the cost. Always paginate or LIMIT before calling `ts_headline`.

✓ Instead: -- Bad: ts_headline runs on every matching row before LIMIT SELECT ts_headline('english', body, q) FROM articles, ... WHERE tsv @@ q; -- Good: limit first using a CTE or subquery, then generate headlines WITH ranked AS ( SELECT id, ts_rank(tsv, q) AS rank FROM articles, websearch_to_tsquery('english', $1) AS q WHERE tsv @@ q ORDER BY rank DESC LIMIT 10 ) SELECT a.title, ts_headline('english', a.body, websearch_to_tsquery('english', $1), 'StartSel=, StopSel=, MaxFragments=2') FROM articles a JOIN ranked r ON a.id = r.id ORDER BY r.rank DESC;

Set `MaxFragments=2` or more to get multiple context fragments like Google search snippets. Use `FragmentDelimiter` to set the separator (e.g., ' ... '). Always wrap match terms in HTML tags for display.

example

SELECT ts_headline('english', body, q, 'StartSel=<mark>, StopSel=</mark>, MaxFragments=3') FROM articles, plainto_tsquery('english', $1) AS q WHERE tsv @@ q;

→HTML-ready search snippets with highlighted terms

ts_rank

PG 8.3+→ float4

Calculates a relevance score for a tsvector vs a tsquery. ts_rank_cd uses cover density, which rewards compact term clustering.

DeveloperData EngDBA

Signatures

ts_rank ( [weights float4[], ] tsvector, tsquery [, normalization integer] ) → float4

ts_rank_cd ( [weights float4[], ] tsvector, tsquery [, normalization integer] ) → float4

Parameters

Parameter	Type	Description
weights	float4[]	Optional weights for D, C, B, A label classes [D, C, B, A]
tsvector	tsvector	Document vector
tsquery	tsquery	Search query
normalization	integer	Bitmask controlling length normalization (0=no norm, 1=doc length, 2=unique words, etc.)

Examples

sql

SELECT ts_rank(to_tsvector('english', body), to_tsquery('english', 'cat')) FROM docs ORDER BY ts_rank DESC;

→Ranked results by relevance

sql

SELECT ts_rank_cd(search_vector, query) AS rank FROM articles, websearch_to_tsquery('english', 'database index') AS query WHERE search_vector @@ query ORDER BY rank DESC;

→Cover-density ranked results

Rank and sort by relevance in one query

sql

SELECT title, ts_rank(to_tsvector('english', content), q) AS rank FROM docs, to_tsquery('english', 'performance & index') q WHERE to_tsvector('english', content) @@ q ORDER BY rank DESC LIMIT 5;

→(top 5 most relevant docs)

Custom weights: D=0.1, C=0.2, B=0.4, A=1.0

sql

SELECT ts_rank('{0.1, 0.2, 0.4, 1.0}', to_tsvector('english', body), to_tsquery('english', 'search')) FROM docs;

→(float relevance scores)

⚠Anti-Pattern— Calling ts_rank without a WHERE @@ clause — ranking every row

`ts_rank` does not filter rows — it only scores them. Without a `WHERE tsv @@ query` clause, PostgreSQL computes a rank for every row in the table (including non-matching rows), then sorts all of them. Always pair `ts_rank` with a `@@` filter.

✓ Instead: -- Bad: scores and sorts every row in the table SELECT title, ts_rank(tsv, to_tsquery('english', 'database')) AS rank FROM articles ORDER BY rank DESC; -- Good: filter first, then rank SELECT title, ts_rank(tsv, q) AS rank FROM articles, to_tsquery('english', 'database') AS q WHERE tsv @@ q ORDER BY rank DESC LIMIT 20;

Without normalization, longer documents rank higher just because they contain more words. Use `ts_rank(vec, query, 1)` (divide by document length) or `ts_rank(vec, query, 2)` (divide by unique lexemes) to normalize.

example

SELECT title, ts_rank(tsv, q, 2) AS rank FROM articles, to_tsquery('english', 'postgresql') AS q WHERE tsv @@ q ORDER BY rank DESC LIMIT 5;

→Top 5 most relevant articles, normalized by unique words

tsvector || tsvector

PG 8.3+→ tsvector

Concatenates two tsvectors, combining their lexemes and positions. Useful for building vectors from multiple columns.

DeveloperData EngDBA

Signature

tsvector || tsvector → tsvector

Parameters

Parameter	Type	Description
tsvector1	tsvector	First tsvector
tsvector2	tsvector	Second tsvector to concatenate

Examples

sql

SELECT to_tsvector('fat cat') || to_tsvector('sat on a mat');

→'cat':2 'fat':1 'mat':6 'sat':3

sql

SELECT setweight(to_tsvector('english', title), 'A') || setweight(to_tsvector('english', body), 'B') FROM articles;

→Title words have higher weight than body

Title words weighted A, body words weighted C

sql

SELECT setweight(to_tsvector('english', title), 'A') || setweight(to_tsvector('english', body), 'C') AS full_doc FROM articles WHERE id = 1;

→'articl':1A 'introduc':4C 'sampl':3C 'text':2A

Maintain weighted search column for fast GIN-indexed search

sql

UPDATE articles SET search_vec = setweight(to_tsvector('english', coalesce(title,'')), 'A') || setweight(to_tsvector('english', coalesce(body,'')), 'C') WHERE id = ANY($1);

→(rows updated)

⚠Anti-Pattern— Concatenating tsvectors at query time instead of storing in a generated column

Building `setweight(...) || setweight(...)` inside the WHERE clause forces every concatenation to happen at query time on every row, preventing GIN index use. Store the pre-computed concatenation in a generated column.

✓ Instead: -- Bad: concatenation at query time — full table scan, no index benefit SELECT * FROM articles WHERE (setweight(to_tsvector('english', title), 'A') || setweight(to_tsvector('english', body), 'B')) @@ plainto_tsquery('english', $1); -- Good: store and index the concatenated vector ALTER TABLE articles ADD COLUMN tsv tsvector GENERATED ALWAYS AS ( setweight(to_tsvector('english', coalesce(title,'')), 'A') || setweight(to_tsvector('english', coalesce(body,'')), 'B') ) STORED; CREATE INDEX articles_tsv_gin ON articles USING GIN(tsv);

Use `setweight(to_tsvector(title), 'A') || setweight(to_tsvector(body), 'B')` to build a weighted vector. ts_rank will score title matches higher than body matches automatically.

example

SELECT setweight(to_tsvector('english', coalesce(title,'')), 'A') || setweight(to_tsvector('english', coalesce(description,'')), 'B') AS tsv FROM products;

→Weighted tsvector: title terms outrank description terms

websearch_to_tsquery

PG 11+→ tsquery

Converts a Google-like search string to a tsquery. Supports quoted phrases, unquoted AND, OR, and minus for NOT.

DeveloperData EngDBA

Signatures

websearch_to_tsquery ( query text ) → tsquery

websearch_to_tsquery ( config regconfig, query text ) → tsquery

Parameters

Parameter	Type	Description
config	regconfig	Text search configuration
query	text	Search query with web-search syntax: OR for OR, -word for NOT, "phrase" for phrase

Examples

sql

SELECT websearch_to_tsquery('english', 'quick brown fox');

→'quick' & 'brown' & 'fox'

sql

SELECT websearch_to_tsquery('english', 'cat OR dog');

→'cat' | 'dog'

sql

SELECT websearch_to_tsquery('english', '"quick fox"');

→'quick' <-> 'fox'

Google-style: - negates, quotes require phrase proximity

sql

SELECT websearch_to_tsquery('simple', 'postgres -mysql "full text"');

→'postgres' & !'mysql' & 'full' <-> 'text'

⚠Anti-Pattern— Using to_tsquery instead of websearch_to_tsquery for user-facing search inputs

`to_tsquery` raises an error on plain user input (spaces, special characters). For any search box that accepts free-form text, `websearch_to_tsquery` is the safe, correct choice — it never throws on unexpected input.

✓ Instead: -- Bad: crashes if user types 'quick brown fox' (space not valid in to_tsquery) SELECT * FROM articles WHERE tsv @@ to_tsquery('english', user_input); -- Good: handles any user input gracefully SELECT * FROM articles WHERE tsv @@ websearch_to_tsquery('english', user_input);

`websearch_to_tsquery` is the most user-friendly query parser — it handles common web search idioms without the strict syntax requirements of `to_tsquery`. Never errors on invalid input; just ignores unsupported tokens.

example

SELECT title FROM posts WHERE tsv @@ websearch_to_tsquery('english', user_input) ORDER BY ts_rank(tsv, websearch_to_tsquery('english', user_input)) DESC LIMIT 20;

→Ranked search results from user-friendly input

PostgreSQL Full-Text Search Functions

Jump to function

What are PostgreSQL Full-Text Search Functions?

phraseto_tsquery

Signatures

Parameters

Examples

Common Gotchas

plainto_tsquery

Signatures

Parameters

Examples

setweight

Signature

Parameters

Examples

to_tsquery

Signatures

Parameters

Examples

to_tsvector

Signatures

Parameters

Examples

Common Gotchas

ts_headline

Signatures

Parameters

Examples

ts_rank

Signatures

Parameters

Examples

tsvector || tsvector

Signature

Parameters

Examples

websearch_to_tsquery

Signatures

Parameters

Examples

Related PostgreSQL Categories