Supabase | The Open Source Firebase Alternative

Using pgvector for finding similar users based on behavior

Database

Vectors

fresh take on pgvector. most "vector in postgres" content is about RAG / document search. i used it for a different thing: finding similar users based on their behavior, not their content.

context: our B2B SaaS has a feature where a user can ask "who else uses this similarly to me?" we wanted to surface other users whose interaction patterns matched theirs — for collaboration suggestions, template sharing, peer connections.

couldn't do this with traditional similarity (cosine on raw event counts). users have different volumes of activity, so "user with most events" was always the closest match. needed something that captured pattern, not magnitude.

what i shipped:

a vector embedding per user, computed from their last 30 days of behavior. the vector represents which actions they take in which proportions, normalized.

create extension if not exists vector;

create table user_behavior_vectors (

user_id uuid primary key references auth.users(id),

vector vector(64),

computed_at timestamptz default now()

);

create index on user_behavior_vectors using hnsw (vector vector_cosine_ops);

the vector dimensions correspond to behaviors. our 64 dimensions are 64 distinct event types we track. each dimension's value is the normalized frequency of that event in the user's last 30 days.

building the vectors (runs nightly via pg_cron):

create or replace function refresh_user_behavior_vectors()

returns void as $$

declare

e_types text[];

begin

-- get our 64 tracked event types

select array_agg(name order by name) into e_types

from event_type_registry where included_in_vector = true limit 64;

-- for each active user, compute their vector

insert into user_behavior_vectors (user_id, vector, computed_at)

select

user_id,

(select array_agg(coalesce(freq, 0)::real)::vector

from unnest(e_types) as et

left join lateral (

select count(*)::float

/ nullif((select count(*) from events e2

How to help

The user describes a feature implemented using pgvector to find similar users based on behavior in a B2B SaaS context. This involves creating vector embeddings from user actions over the last 30 days, stored in a PostgreSQL table with an HNSW index for fast querying. The feature aims to enhance collaboration by suggesting users with similar interaction patterns. The user seeks feedback on similar use cases and experiences with behavior-as-vector approaches.

Help on Reddit

Replies (2)

Sounds complex what’s your background?

No-Read-2843·5/31/2026, 8:23:05 AM

[ Removed by Reddit ]

Embarrassed-War9550·6/1/2026, 11:13:45 AM