Evaluation
0582 | Francesco Luigi De Faveri, Guglielmo Faggioli and Nicola Ferro | Measuring Actual Privacy of Obfuscated Queries in Information Retrieval |
1336 | Kasra Hosseini, Thomas Kober, Josip Krapac, Roland Vollgraf, Weiwei Cheng and Ana Peleteiro Ramallo | Retrieve, Annotate, Evaluate, Repeat: Leveraging Multimodal LLMs for Large-Scale Product Retrieval Evaluation |
2804 | Jack McKechnie, Graham McDonald and Craig Macdonald | Context Example Selection For LLM Generated Relevance Assessments |
3868 | Maik Fröbe, Andrew Parry, Harrisen Scells, Shuai Wang, Shengyao Zhuang, Guido Zuccon, Martin Potthast and Matthias Hagen | Corpus Subsampling: Estimating the Effectiveness of Neural Retrieval Models on Large Corpora |
7064 | Pooya Khandel, Andrew Yates, Ana Lucia Varbanescu, Maarten de Rijke and Andy Pimentel | PEIR: Modeling Performance in Neural Information Retrieval |
9267 | David Otero, Javier Parapar and Alvaro Barreiro | Towards Reliable Testing for Multiple Information Retrieval System Comparisons |
Domain-specific tasks and specific user groups
9693 | Robin Ungruh, Alejandro Bellogín and Maria Soledad Pera | The Impact of Mainstream-Driven Algorithms on Recommendations For Children |
4467 | Jesus Lovon-Melgarejo, Martin Mouysset, Jo Oleiwan, Jose G Moreno, Christine Damase-Michel and Lynda Tamine | Evaluating LLM Abilities to Understand Tabular Electronic Health Records: A Comprehensive Study of Patient Data Extraction and Retrieval |
9306 | Sajad Ebrahimi, Sara Salamat, Negar Arabzadeh, Mahdi Bashari and Ebrahim Bagheri | exHarmony: Authorship and Citations for Benchmarking the Reviewer Assignment Problem |
9880 | André Rolim, Leandro Marinho, Edleno Moura, Marcos Domingues and Ricardo Oliveira | Leveraging Query Terms for Efficient Legal Document Recommendation |
1145 | Sumedh Vemuganti, Ayu Seiya and Nickvash Kani | Advancing Math Formula Search Using Diverse Structural and Symbolic Representations |
From facts and fairness to adversaries
0384 | Bo Pang, Tingrui Qiao, Caroline Walker, Chris Cunningham and Yun Sing Koh | LIBRA: Measuring Bias of Large Language Model from a Local Context |
3409 | Bjørnar Vassøy, Benjamin Kille and Helge Langseth | Opt-in Transparent Fairness for Recommender Systems |
2829 | Anton Chernyavskiy, Dmitry Ilvovsky and Preslav Nakov | Enhancing FEVER-Style Claim Fact-Checking Against Wikipedia: A Diagnostic Taxonomy and Generative Framework |
4219 | Andreea Iana, Fabian David Schmidt, Goran Glavaš and Heiko Paulheim | News Without Borders: Domain Adaptation of Multilingual Sentence Embeddings for Cross-lingual News Recommendation |
8730 | Paloma Piot and Javier Parapar | Towards Efficient and Explainable Hate Speech Detection via Model Distillation |
9381 | Antonio Ferrara, Angela Di Fazio, Alberto Carlo Maria Mancino, Tommaso Di Noia and Eugenio Di Sciascio | Enhancing Utility in Differentially Private Recommendation Data Release via Exponential Mechanism |
Graphs & RAG
1527 | Vishwajeet Kumar, Jaydeep Sen, Bhawna Chelani and Soumen Chakrabarti | Graph Representation of Tables+Text and Compact Subgraph Retrieval for QA Tasks |
1745 | Giuseppe Pirrò | Higher Order Knowledge Graph Embeddings |
7820 | Roan Schellingerhout, Francesco Barile and Nava Tintarev | Town Mice versus Country Mice: Urban Bias in Job Recommender Systems |
4152 | Imed Keraghel and Mohamed Nadif | Graph-Convolutional Networks: Named Entity Recognition and Large Language Model Embedding in Document Clustering |
7124 | Jorge Gabín and Javier Parapar | Leveraging Retrieval-Augmented Generation for Keyphrase Synonym Suggestion |
0558 | Fangzheng Tian, Debasis Ganguly and Craig Macdonald | Is Relevance ‘Lost in Transmission’ from Retriever to Generator? |
Recommenders
1942 | Yuanna Liu, Ming Li, Mohammad Aliannejadi and Maarten de Rijke | Repeat-bias-aware Optimization of Beyond-accuracy Metrics for Next Basket Recommendation |
9668 | Aleksandr V. Petrov, Efi Karra Taniskidou and Sean Murphy | CountNet: Utilising Repetition Counts in Sequential Recommendation |
4072 | Simone Borg Bruun, Maria Maistro and Christina Lioma | Feature Attribution Explanations of Session-based Recommendations |
0487 | Armin Moradi, Nicola Neophytou, Florian Carichon and Golnoosh Farnadi | Embedding Cultural Diversity in Prototype-based Recommender Systems |
7820 | Roan Schellingerhout, Francesco Barile and Nava Tintarev | Town Mice versus Country Mice: Urban Bias in Job Recommender Systems |
6227 | Keigo Sakurai, Ren Togo, Takahiro Ogawa and Miki Haseyama | LLM is Knowledge Graph Reasoner: LLM’s Intuition-aware Knowledge Graph Reasoning for Cold-start Sequential Recommendation |
Conversational and Robust IR
3581 | Lili Lu, Chuan Meng, Federico Ravenda, Mohammad Aliannejadi and Fabio Crestani | Zero-Shot and Efficient Clarification Need Prediction in Conversational Search |
1879 | Zahra Abbasiantaeb, Chuan Meng, Leif Azzopardi and Mohammad Aliannejadi | Improving the Re-Usability of Conversational Search Test Collections |
3486 | Pengjie Ren, Ruiqi Li, Zhaochun Ren, Zhumin Chen, Maarten de Rijke and Yangjun Zhang | Malevolence Attacks Against Pretrained Dialogue Models |
7085 | Orion Weller, Benjamin Chang, Eugene Yang, Mahsa Yarmohammadi, Sam Barham, Sean MacAvaney, Arman Cohan, Luca Soldaini, Benjamin Van Durme and Dawn Lawrie | mFollowIR: a Multilingual Benchmark for Instruction Following in Information Retrieval |
5515 | Guglielmo Faggioli, Nicola Ferro, Raffaele Perego and Nicola Tonellotto | Query Performance Prediction using Dimension Importance Estimators |
8788 | Yu-An Liu, Ruqing Zhang, Jiafeng Guo, Changjiang Zhou, Maarten de Rijke and Xueqi Cheng | On the Robustness of Generative Information Retrieval Models: An Out-of-Distribution Perspective |
About rankers and rerankers
2008 | Mandeep Rathee, Sean MacAvaney and Avishek Anand | Guiding Retrieval using Large Language Models |
3909 | Ferdinand Schlatt, Maik Fröbe, Harrisen Scells, Shengyao Zhuang, Bevan Koopman, Guido Zuccon, Benno Stein, Martin Potthast and Matthias Hagen | Set-Encoder: Permutation-Invariant Inter-Passage Attention for Listwise Passage Re-Ranking with Cross-Encoders |
6131 | Xinyu Zhang, Sebastian Hofstätter, Patrick Lewis, Raphael Tang and Jimmy Lin | Rank-without-GPT: Building GPT-Independent Listwise Rerankers on Open-Source Large Language Models |
5449 | Shuoqi Sun, Shengyao Zhuang, Shuai Wang and Guido Zuccon | An Investigation of Prompt Variations for Zero-shot LLM-based Rankers |
7689 | Marwa Essam and Tamer Elsayed | Can Large Language Models Effectively Rerank News Articles for Background Linking? |
0587 | Manish Chandra, Debasis Ganguly and Iadh Ounis | One size doesn’t fit all: Predicting the Number of Examples for In-Context Learning |
Across modalities and languages
3699 | Zixuan Yi and Iadh Ounis | A Multi-modal Recipe for Improved Multi-domain Recommendation |
3703 | Nicola Messina, Lucia Vadicamo, Leo Maltese and Claudio Gennaro | Towards Identity-Aware Cross-Modal Retrieval: a Dataset and a Baseline |
3953 | Sushil Awale, Eric Müller-Budack and Ralph Ewerth | Patent Figure Classification using Large Vision-language Models |
5307 | Wanqing Cui, Rui Cheng, Jiafeng Guo and Xueqi Cheng | MVAM: Multi-View Attention Method for Fine-grained Image-Text Matching |
4384 | Giacomo Pacini, Fabio Carrara, Nicola Messina, Nicola Tonellotto, Giuseppe Amato and Fabrizio Falchi | Maybe you are looking for CroQS: Cross-modal Query Suggestion for Text-to-Image Retrieval |
8772 | Sogol Haghighat, Tim Daniel Metzler, Santosh Thoduka and Sebastian Houben | Visual Latent Captioning – Towards Verbalizing Vision Transformer Encoders |
Efficiency in IR and NLP
1055 | Eugene Yang, Nicola Tonellotto, Dawn Lawrie, Sean MacAvaney, James Mayfield, Douglas Oard and Scott Miller | MURR: Model Updating with Regularized Replay for Searching a Document Stream |
1096 | Shanxiu He, Mutasem Al-Darabsah, Suraj Nair, Jonathan May, Tarun Agarwal, Tao Yang and Choon Hui Teo | Token Pruning Optimization for Efficient Dense Retrieval with Multi-Vector Representations |
8069 | Ghazaleh Haratinezhad Torbati, Anna Tigunova, Gerhard Weikum and Andrew Yates | CUP: a Framework for Resource-Efficient Review-Based Recommenders |
2593 | Yingrui Yang, Parker Carlson, Yifan Qiao, Wentai Xie, Shanxiu He and Tao Yang | LSTM-based Selective Dense Text Retrieval Guided by Sparse Lexical Retrieval |
9274 | Aymen Berriche, Mehdi Zakaria Adjal and Riyadh Baghdadi | Leveraging High-Resolution Features for Improved Deep Hashing-based Image Retrieval |
3695 | Fatos Torba, Christophe Gravier, Charlotte Laclau, Abderrahmen Kammoun and Julien Subercaze | Decoding the Hierarchy: A Hybrid Approach to Hierarchical Multi-Label Text Classification |
Findings
3147 | Sonali Singh, Sachin Farfade and Prakash Mandayam Comar | Evaluating Auto-complete Ranking for Diversity and Relevance |
3324 | Ariane Mueller and Craig Macdonald | Semantically Proportioned nDCG for Explaining ColBERT’s Learning Process |
4173 | John Paul Vargheese, Marianne Wilson, Katherine Stephen, Rachel Salzano and David Brazier | Exploring the relationship between listener receptivity and source of music recommendations |
5672 | Karn N Watcharasupat, Yiwei Ding, Aleksandra T Ma, Pavan Seshadri and Alexander Lerch | Uncertainty Estimation in the Real World: A study on Music Emotion Recognition |
9368 | Wolfgang Gritz, Anett Hoppe and Ralph Ewerth | Unraveling the Impact of Visual Complexity on Search as Learning |
6199 | Fausto German, Brian Keith, Mauricio Matus, Diego Urrutia and Claudio Meneses | Semi-supervised image-based narrative extraction: A case study with historical photographic records |
1329 | Ronak Pradeep, Nandan Thakur, Sahel Sharifymoghaddam, Eric Zhang, Ryan Nguyen, Daniel Campos, Nick Craswell and Jimmy Lin | FrameworkX: A Reusable RAG Framework and Baselines for TrackY |
2119 | Jan Hutter, David Rau, Maarten Marx and Jaap Kamps | Lost but Not Only in the Middle: Positional Bias in Retrieval Augmented Generation |
4076 | Anastasiia Klimashevskaia, Snorre Alvsvåg, Christoph Trattner, Alain D. Starke, Astrid Tessem and Dietmar Jannach | Evaluating Sequential Recommendations in the Wild: A Case Study on Offline Accuracy, Click Rates, and Consumption |
2590 | Héctor López Hidalgo, Michel Boeglin, David Kahn, Josiane Mothe, Diego Ortiz and David Panzoli | Biased PromptORE: Enhancing Relation Extraction in Gendered Languages and Complex Texts – The Case of Spanish Documents from the XVI Century |
3995 | Gijs Hendriksen, Djoerd Hiemstra and Arjen de Vries | Efficient Session Retrieval Using Topical Index Shards |