Lucene-CuVS Benchmarks

Prerequisites

Before running

Build libcuvs libraries and CuVS Java API

(For now, please comment out cuvsRMMPoolMemoryResourceEnable in both CAGRA and Bruteforce build index methods in the C wrapper)

git clone [email protected]:rapidsai/cuvs.git \
&& cd cuvs \
&& git checkout branch-25.02 \
&& ./build.sh libcuvs java

Build Lucene-CuVS

git clone [email protected]:SearchScale/lucene.git \
&& cd lucene \
&& git checkout cuvs-integration-main \
&& ./gradlew compileJava mavenToLocal

Download the Wikipedia Dataset (5M vectors x 2048 dimensions), queries (100 x 2048 dimensions), and groundtruth (100 x 64 topk)

wget https://accounts.searchscale.com/datasets/wikipedia/ground_truth_100x64.csv \
&& wget https://accounts.searchscale.com/datasets/wikipedia/queries_100.csv.mapdb \
&& wget https://accounts.searchscale.com/datasets/wikipedia/wiki_dump_5Mx2048D.csv.gz.mapdb

Running Manually

Steps:

Add your benchmark job configuration in the jobs.json file
do ./benchmarks.sh jobs.json
If saveResultsOnDisk is set as true (in jobs.json) then you can find your benchmark results in the results folder. For each successful benchmark run, two files are created ${benchmark_id}__benchmark_results_${timestamp}.json and ${benchmark_id}__neighbors_${timestamp}.csv

Name		Name	Last commit message	Last commit date
Latest commit History 51 Commits
src		src
utils		utils
.gitignore		.gitignore
README.md		README.md
benchmarks.sh		benchmarks.sh
jobs.json		jobs.json
jobs_sift.json		jobs_sift.json
jobs_sift_small.json		jobs_sift_small.json
jobs_wikipedia.json		jobs_wikipedia.json
pom.xml		pom.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Lucene-CuVS Benchmarks

Prerequisites

Before running

Build libcuvs libraries and CuVS Java API

Build Lucene-CuVS

Download the Wikipedia Dataset (5M vectors x 2048 dimensions), queries (100 x 2048 dimensions), and groundtruth (100 x 64 topk)

Running Manually

About

Releases

Packages

Languages

SearchScale/vectorsearch-benchmarks

Folders and files

Latest commit

History

Repository files navigation

Lucene-CuVS Benchmarks

Prerequisites

Before running

Build libcuvs libraries and CuVS Java API

Build Lucene-CuVS

Download the Wikipedia Dataset (5M vectors x 2048 dimensions), queries (100 x 2048 dimensions), and groundtruth (100 x 64 topk)

Running Manually

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages