Ukusesha kweVekhtha kuletha ngempumelelo ukufana kwe-semantic ukuze kubuyiswe isizukulwane esithuthukisiwe, kodwa akwenzi kahle ngokusesha kwamagama angukhiye amafushane noma amagama okusesha angaphandle kwesizinda. Ukwengeza ukubuyiswa kwe-vector ngokusesha kwamagama angukhiye njenge-BM25 nokuhlanganisa imiphumela ne-reranker kuba indlela evamile yokuthola okungcono kakhulu kuyo yomibili imihlaba.
Ama-Reranker amamodeli e-ML athatha isethi yemiphumela yosesho futhi ayihlele kabusha ukuze kuthuthukiswe ukuhambisana. Bahlola umbuzo obhangqwe nomphumela ngamunye wekhandidethi ngemininingwane, obiza kakhulu ngokwekhompyutha kodwa okhiqiza imiphumela enembe kakhulu kunezindlela zokubuyisa ezilula zizodwa. Lokhu kungenziwa njengesigaba sesibili phezu kokusesha okukodwa (khipha imiphumela eyi-100 ekusesheni kwevekhtha, bese ucela ohlela kabusha ukuthi akhombe okungu-10 okuphezulu) noma, kaningi, ukuhlanganisa imiphumela evela ezinhlotsheni ezihlukene zokusesha; kulokhu, usesho lwe-vector nokusesha igama elingukhiye.
Kodwa kuhle kangakanani abahlaziyi be-off-the-shelf? Ukuze ngithole, ngihlole ama-renkers ayisithupha embhalweni ovela ku-
Sihlole lezi renkers:
Abahlaziyi banikezwe imiphumela ephezulu engama-20 evela kokubili i-DPR ne-BM25, futhi i-NDCG@5 ehlaziywe yahlolwa.
Emiphumeleni, usesho lwevekhtha eluhlaza (okushumekiwe olusuka kumodeli ye-bge-m3) lubhalwe ukuthi dpr (ukubuyisa isiqephu esiminyene). I-BGE-m3 ikhethelwe ukubala okushumekiwe ngoba yilokho okwasetshenziswa ababhali be-ColPali njengesisekelo.
Nansi idatha emayelana nokuhambisana (NDCG@5):
Futhi nakhu ukuthi bashesha kangakanani ekuhleleni kabusha usesho kudathasethi ye-arxiv; I-latency ilingana nobude bedokhumenti. Lokhu ukubambezeleka kwe-graphing, ngakho okuphansi kungcono. Imodeli ye-bge ezibambele yona yasetshenziswa kuzo zombili i-NVIDIA 3090 kusetshenziswa ikhodi elula kakhulu ephakanyiswe isuka
Okokugcina, nakhu ukuthi kubiza malini ngemodeli ngayinye ukuphinda umise ukusesha okucishe kube ngu-3,000 kuwo wonke amadathasethi ayisithupha. Izintengo ezihlangene ngokusesha ngakunye (nezinkokhelo ezengeziwe zamadokhumenti amade), kanti amanye amanani ethokheni ngayinye.
I-RRF yengeza inani elincane noma elingekho ezimeni zokucinga ezixubile; engxenyeni yamadathasethi, lenze kabi kakhulu kune-BM25 noma i-DPR iyodwa. Ngokuphambene, bonke abahlaziyi abasekelwe ku-ML abahloliwe balethe ukuthuthuka okunengqondo ngaphezu kwevekhtha emsulwa noma ukusesha kwegama elingukhiye, i-Voyage rerank-2 ibeka ibha ukuze ihlobane.
Ukuhwebelana kusekhona: ukunemba okuphakeme okuvela ku-Voyage relank-2, ukucutshungulwa okusheshayo okuvela ku-Cohere, noma ukusebenza okuqinile okumaphakathi okuvela kuJina noma kumodeli ye-Voyage lite. Ngisho ne-reranker yomthombo ovulekile ye-BGE, ngenkathi ilandela izinketho zokuthengisa, yengeza inani elibalulekile emaqenjini akhetha ukuzibamba.
Njengoba amamodeli esisekelo eqhubeka nokuthuthuka, singalindela ukusebenza okungcono nakakhulu. Kodwa abahlaziyi be-ML banamuhla sebevuthwe ngokwanele ukuthi bangasebenza ngokuzethemba kukho konke okuqukethwe ngezilimi eziningi.
Ngu-Jonathan Ellis, DataStax