Passage Segmentation of Documents for Extractive Question Answering | Zuhong Liu, Fabien Caspani and Charles-Elie Simon |
DiffGR: A Discrete Diffusion-Based Model for Personalised Recommendation by Reconstructing User-Item Bipartite Graphs | Zheng Ju, Honghui Du, Elias Tragos, Neil Hurley and Aonghus Lawlor |
Hierarchical Skip Decoding for Efficient Autoregressive Language Model | Yunqi Zhu, Xuebing Yang, Yuanyuan Wu and Wensheng Zhang |
EGL-DST: Error-Guided Learning for Multidimensional Evaluation Method of Dialogue State Tracking via GPT-4 | Wenjie Dong, Sirong Chen, Ming Gu and Yan Yang |
Examining the Impact of Transcript Accuracy on Podcast Search and Re-Ranking | Watheq Mansour, Shane Culpepper and Joel Mackenzie |
Patience in Proximity: A Simple Early Termination Strategy for HNSW Graph Traversal in Approximate k-Nearest Neighbor Search | Tommaso Teofili and Jimmy Lin |
BAAF – A Framework for Media Bias Detection | Soumyadeep Sar, Subinay Adhikary and Dwaipayan Roy |
Ranking Generated Answers: On the Agreement of Retrieval Models with Humans on Consumer Health Questions | Sebastian Heineking, Jonas Probst, Daniel Steinbach, Martin Potthast and Harrisen Scells |
Investigating the Scalability of Approximate Sparse Retrieval Algorithms to Massive Datasets | Sebastian Bruch, Franco Maria Nardini, Cosimo Rulli, Rossano Venturini and Leonardo Venuta |
Efficient Constant-Space Multi-Vector Retrieval | Sean MacAvaney, Antonio Mallia and Nicola Tonellotto |
Entity-Aware Cross-Modal Pretraining for Knowledge-based Visual Question Answering | Omar Adjali, Olivier Ferret, Sahar Ghannay and Hervé Le Borgne |
SAFERec: Self-Attention and Frequency Enriched Model for Next Basket Recommendation | Oleg Lashinin, Krasilnikov Denis, Aleksandr Milogradskii and Marina Ananyeva |
A new dataset for keyword extraction from IT job descriptions | Nisan Fichman, Hadar Isaacson and Natalia Vanetik |
A Test Collection for Dataset Retrieval | Nikolay Kolyada, Martin Potthast and Benno Stein |
Iterative Self-Training for Code Generation via Reinforced Re-Ranking | Nikita Sorokin, Ivan Sedykh and Valentin Malykh |
CLASP: Contrastive Language-Speech Pretraining for Multilingual Multimodal Information Retrieval | Mohammad Mahdi Abootorabi and Ehsaneddin Asgari |
Retrieval-Augmented Neural Team Formation | Mohammad Dara, Radin Hamidi Rad, Fattane Zarrinkalam and Ebrahim Bagheri |
Improving RAG for Personalization with Author Features and Contrastive Examples | Mert Yazan, Frederik Bungaran Ishak Situmeang and Suzan Verberne |
Improving Language Model Performance by Training on Prototypical Contradictions | Maren Pielka, Marie-Christin Freischlad, Svetlana Schmidt and Rafet Sifa |
Sequence-to-Sequence Encoder-Decoder Models for Efficient Listwise Reranking | Manveer Singh Tamber, Ronak Pradeep and Jimmy Lin |
Large Language Model Can Be a Foundation for Hidden Rationale-Based Retrieval | Luo Ji, Feixiang Guo, Teng Chen, Qingqing Gu, Xiaoyu Wang, Ningyuan Xi, Yihong Wang, Peng Yu, Yue Zhao, Hongyang Lei, Zhonglin Jiang and Yong Chen |
Can Generative AI Adequately Protect Queries? Analyzing the Trade-off Between Privacy Awareness and Retrieval Effectiveness | Luca Herranz-Celotti, Blessing Guembe, Giovanni Livraga and Marco Viviani |
Approximate Bag-of-Words Top-k Corpus Graphs | Lachlan Dunn, Luke Gallagher and Joel Mackenzie |
A Simple but Effective Closed-form Solution for Extreme Multi-label Learning | Kazuma Onishi and Katsuhiko Hayashi |
ColBERT-serve: Efficient Multi-Stage Memory-Mapped Scoring | Kaili Huang, Thejas Venkatesh, Uma Dingankar, Antonio Mallia, Daniel Campos, Jian Jiao, Christopher Potts, Matei Zaharia, Kwabena Boahen, Omar Khattab, Saarthak Sarup and Keshav Santhanam |
Do LLMs Provide Consistent Answers to Health-Related Questions across Languages? | Ipek Baris Schlicht, Zhixue Zhao, Burcu Sayin, Lucie Flek and Paolo Rosso |