Approximate nearest neighbor methods and vector models

jj·2021년 2월 26일

SS-hashtag-recommendation-project

목록 보기

14/15

출처: spotify engineering lead slideshare

Buliding an Annoy index
- start with the point set
- split it in 2 halves
- split again (until k items in each leaf, takes n/k memory instead n)
- binary tree
Search
- the closest isn't necessarily in the same leaf of the binary tree
- 2 points that are really close may end up on different sides of split
→ Go both sides of a split if it's close

pip install --user annoy

pip install pynndescent

재밌는게 재밌는거다