Class HnswGraphBuilder

java.lang.Object
org.apache.lucene.util.hnsw.HnswGraphBuilder
All Implemented Interfaces:
HnswBuilder
Direct Known Subclasses:
HnswConcurrentMergeBuilder.ConcurrentMergeWorker, InitializedHnswGraphBuilder

public class HnswGraphBuilder extends Object implements HnswBuilder
Builder for HNSW graph. See HnswGraph for a gloss on the algorithm and the meaning of the hyper-parameters.
  • Field Details

  • Constructor Details

    • HnswGraphBuilder

      protected HnswGraphBuilder(RandomVectorScorerSupplier scorerSupplier, int M, int beamWidth, long seed, int graphSize) throws IOException
      Reads all the vectors from vector values, builds a graph connecting them by their dense ordinals, using the given hyperparameter settings, and returns the resulting graph.
      Parameters:
      scorerSupplier - a supplier to create vector scorer from ordinals.
      M - – graph fanout parameter used to calculate the maximum number of connections a node can have – M on upper layers, and M * 2 on the lowest level.
      beamWidth - the size of the beam search to use when finding nearest neighbors.
      seed - the seed for a random number generator used during graph construction. Provide this to ensure repeatable construction.
      graphSize - size of graph, if unknown, pass in -1
      Throws:
      IOException
    • HnswGraphBuilder

      protected HnswGraphBuilder(RandomVectorScorerSupplier scorerSupplier, int M, int beamWidth, long seed, OnHeapHnswGraph hnsw) throws IOException
      Throws:
      IOException
    • HnswGraphBuilder

      protected HnswGraphBuilder(RandomVectorScorerSupplier scorerSupplier, int M, int beamWidth, long seed, OnHeapHnswGraph hnsw, HnswLock hnswLock, HnswGraphSearcher graphSearcher) throws IOException
      Reads all the vectors from vector values, builds a graph connecting them by their dense ordinals, using the given hyperparameter settings, and returns the resulting graph.
      Parameters:
      scorerSupplier - a supplier to create vector scorer from ordinals.
      M - – graph fanout parameter used to calculate the maximum number of connections a node can have – M on upper layers, and M * 2 on the lowest level.
      beamWidth - the size of the beam search to use when finding nearest neighbors.
      seed - the seed for a random number generator used during graph construction. Provide this to ensure repeatable construction.
      hnsw - the graph to build, can be previously initialized
      Throws:
      IOException
  • Method Details