Lupa

Izpis gradiva Pomoč

A- | A+ | Natisni
Naslov:Bridging the question–answer gap in retrieval-augmented generation : hypothetical prompt embeddings
Avtorji:ID Vake, Domen (Avtor)
ID Vičič, Jernej (Avtor)
ID Tošić, Aleksandar (Avtor)
Datoteke:.pdf RAZ_Vake_Domen_2025.pdf (1,41 MB)
MD5: A642E181DB07BC4D6B4A0B0058B6E8EE
 
URL https://ieeexplore.ieee.org/document/11080443
 
Jezik:Angleški jezik
Vrsta gradiva:Članek v reviji
Tipologija:1.01 - Izvirni znanstveni članek
Organizacija:FAMNIT - Fakulteta za matematiko, naravoslovje in informacijske tehnologije
Opis:Retrieval-Augmented Generation (RAG) systems synergize retrieval mechanisms with generative language models to enhance the accuracy and relevance of responses. However, bridging the style gap between user queries and relevant information in document text remains a persistent challenge in retrieval-augmented systems, often addressed by runtime solutions (e.g., Hypothetical Document Embeddings (HyDE)) that attempt to improve alignment but introduce extra computational overhead at query time. To address these challenges, we propose Hypothetical Prompt Embeddings (HyPE), a framework that shifts the generation of hypothetical content from query time to the indexing phase. By precomputing multiple hypothetical prompts for each data chunk and embedding the chunk in place of the prompt, HyPE transforms retrieval into a question-question matching task, bypassing the need for runtime synthetic answer generation. This approach does not introduce latency but also strengthens the alignment between queries and relevant context. Our experimental results on six common datasets show that HyPE can improve retrieval context precision by up to 42 percentage points and claim recall by up to 45 percentage points, compared to standard approaches, while remaining compatible with re-ranking, multi-vector retrieval, query decomposition, and other RAG advancements.
Ključne besede:LLM, hypothetical prompt embedding, Retrieval-Augmented Generation (RAG)
Datum objave:15.07.2025
Leto izida:2025
Št. strani:str. 129952-129961
Številčenje:Vol. 13
PID:20.500.12556/RUP-21512 Povezava se odpre v novem oknu
UDK:004.8
ISSN pri članku:2169-3536
DOI:10.1109/ACCESS.2025.3589499 Povezava se odpre v novem oknu
COBISS.SI-ID:244701955 Povezava se odpre v novem oknu
Datum objave v RUP:04.08.2025
Število ogledov:565
Število prenosov:4
Metapodatki:XML DC-XML DC-RDF
:
Kopiraj citat
  
Skupna ocena:(0 glasov)
Vaša ocena:Ocenjevanje je dovoljeno samo prijavljenim uporabnikom.
Objavi na:Bookmark and Share


Postavite miškin kazalec na naslov za izpis povzetka. Klik na naslov izpiše podrobnosti ali sproži prenos.

Gradivo je del revije

Naslov:IEEE access
Založnik:Institute of Electrical and Electronics Engineers
ISSN:2169-3536
COBISS.SI-ID:519839513 Povezava se odpre v novem oknu

Gradivo je financirano iz projekta

Financer:EC - European Commission
Številka projekta:101135012
Naslov:Application-level Swarm-based Orchestration Across the Cloud-to-Edge Continuum
Akronim:Swarmchestrate

Licence

Licenca:CC BY 4.0, Creative Commons Priznanje avtorstva 4.0 Mednarodna
Povezava:http://creativecommons.org/licenses/by/4.0/deed.sl
Opis:To je standardna licenca Creative Commons, ki daje uporabnikom največ možnosti za nadaljnjo uporabo dela, pri čemer morajo navesti avtorja.

Sekundarni jezik

Jezik:Slovenski jezik
Ključne besede:umetna inteligenca, HyPE, HyDE


Komentarji

Dodaj komentar

Za komentiranje se morate prijaviti.

Komentarji (0)
0 - 0 / 0
 
Ni komentarjev!

Nazaj
Logotipi partnerjev Univerza v Mariboru Univerza v Ljubljani Univerza na Primorskem Univerza v Novi Gorici