Sample notebook for merging hnsw graphs

Chenzhe_Jin · June 4, 2025, 1:09am

I am wondering whether or where I can find the sample code for this blog Speeding up merging of HNSW graphs - Elasticsearch Labs?

carly.richmond · June 4, 2025, 11:10am

Hi @Chenzhe_Jin,

Welcome! To confirm are you looking for a notebook containing the code for the experiment rather than the discussed pseudocode? Normally search labs accompanying notebooks are located in the elasticsearch-labs GitHub repo under the supporting-blog-content folder, but I expect there isn't one for this piece.

The author @Tom_Veasey and @mayya may be able to help.

mayya · June 4, 2025, 1:04pm

This change is incorporated in Lucene (a library that Elasticsearch uses underneath).

You can study the code in the following PR.

Chenzhe_Jin · June 4, 2025, 8:30pm

Thanks for your reply! I am also wondering if there is some code or instructions we can follow to reproduce the experiments in the blog and PR?

Chenzhe_Jin · June 5, 2025, 2:31am

Hi, I have another question when I read the code. Can your merge strategy support parallelism? When I read the code, I find there is no parallelism design. Thanks for your patience.

mayya · June 5, 2025, 4:01pm

No, currently there is no parallelism.
We go through all segment graphs and add them incrementally to the biggest graph.

Topic		Replies	Views
[ANN] SegmentSpy - Site Plugin to watch segments in realtime Elasticsearch	6	579	July 6, 2017
Segment Merge query Elasticsearch	3	550	July 5, 2017
Optimizing segment merge settings for high search throughput Elasticsearch	1	619	July 5, 2017
Hadoop documentation? Elasticsearch	2	229	July 6, 2017
New Site / Docs Elasticsearch	6	270	July 6, 2017

Sample notebook for merging hnsw graphs

Related topics