Distributionally Robust Unsupervised Dense Retrieval Training on Web Graphs

0upvotes

By: Peixuan Han, Zhenghao Liu, Zhiyuan Liu, Chenyan Xiong

This paper introduces Web-DRO, an unsupervised dense retrieval model, which clusters documents based on web structures and reweights the groups during contrastive training. Specifically, we first leverage web graph links and contrastively train an embedding model for clustering anchor-document pairs. Then we use Group Distributional Robust Optimization to reweight different clusters of anchor-document pairs, which guides the model to assign... more

Information RetrievalOctober 26, 2023 6:37am

Comments (0)
Views (208)

Large Search Model: Redefining Search Stack in the Era of LLMs

0upvotes

By: Liang Wang, Nan Yang, Xiaolong Huang, Linjun Yang, Rangan Majumder, Furu Wei

Modern search engines are built on a stack of different components, including query understanding, retrieval, multi-stage ranking, and question answering, among others. These components are often optimized and deployed independently. In this paper, we introduce a novel conceptual framework called large search model, which redefines the conventional search stack by unifying search tasks with one large language model (LLM). All tasks are form... more

Information RetrievalOctober 24, 2023 3:03pm

Comments (0)
Views (189)

Budgeted Embedding Table For Recommender Systems

0upvotes

By: Yunke Qu, Tong Chen, Quoc Viet Hung Nguyen, Hongzhi Yin

At the heart of contemporary recommender systems (RSs) are latent factor models that provide quality recommendation experience to users. These models use embedding vectors, which are typically of a uniform and fixed size, to represent users and items. As the number of users and items continues to grow, this design becomes inefficient and hard to scale. Recent lightweight embedding methods have enabled different users and items to have diver... more

Information RetrievalOctober 24, 2023 1:04pm

Comments (0)
Views (190)

Unified Pretraining for Recommendation via Task Hypergraphs

0upvotes

By: Mingdai Yang, Zhiwei Liu, Liangwei Yang, Xiaolong Liu, Chen Wang, Hao Peng, Philip S. Yu

Although pretraining has garnered significant attention and popularity in recent years, its application in graph-based recommender systems is relatively limited. It is challenging to exploit prior knowledge by pretraining in widely used ID-dependent datasets. On one hand, user-item interaction history in one dataset can hardly be transferred to other datasets through pretraining, where IDs are different. On the other hand, pretraining and f... more

Information RetrievalOctober 23, 2023 10:23am

Comments (0)
Views (209)

Motif-Based Prompt Learning for Universal Cross-Domain Recommendation

0upvotes

By: Bowen Hao, Chaoqun Yang, Lei Guo, Junliang Yu, Hongzhi Yin

Cross-Domain Recommendation (CDR) stands as a pivotal technology addressing issues of data sparsity and cold start by transferring general knowledge from the source to the target domain. However, existing CDR models suffer limitations in adaptability across various scenarios due to their inherent complexity. To tackle this challenge, recent advancements introduce universal CDR models that leverage shared embeddings to capture general knowle... more

Information RetrievalOctober 23, 2023 10:16am

Comments (0)
Views (195)

Towards Multi-Subsession Conversational Recommendation

0upvotes

By: Yu Ji, Qi Shen, Shixuan Zhu, Hang Yu, Yiming Zhang, Chuan Cui, Zhihua Wei

Conversational recommendation systems (CRS) could acquire dynamic user preferences towards desired items through multi-round interactive dialogue. Previous CRS mainly focuses on the single conversation (subsession) that user quits after a successful recommendation, neglecting the common scenario where user has multiple conversations (multi-subsession) over a short period. Therefore, we propose a novel conversational recommendation scenario ... more

Information RetrievalOctober 23, 2023 9:55am

Comments (0)
Views (187)

Thoroughly Modeling Multi-domain Pre-trained Recommendation as Language

0upvotes

By: Zekai Qu, Ruobing Xie, Chaojun Xiao, Yuan Yao, Zhiyuan Liu, Fengzong Lian, Zhanhui Kang, Jie Zhou

With the thriving of pre-trained language model (PLM) widely verified in various of NLP tasks, pioneer efforts attempt to explore the possible cooperation of the general textual information in PLM with the personalized behavioral information in user historical behavior sequences to enhance sequential recommendation (SR). However, despite the commonalities of input format and task goal, there are huge gaps between the behavioral and textual ... more

Information RetrievalOctober 23, 2023 8:59am

Comments (0)
Views (200)

DCRNN: A Deep Cross approach based on RNN for Partial Parameter Sharing in Multi-task Learning

0upvotes

By: Jie Zhou, Qian Yu

In recent years, DL has developed rapidly, and personalized services are exploring using DL algorithms to improve the performance of the recommendation system. For personalized services, a successful recommendation consists of two parts: attracting users to click the item and users being willing to consume the item. If both tasks need to be predicted at the same time, traditional recommendation systems generally train two independent models... more

Information RetrievalOctober 20, 2023 10:45am

Comments (0)
Views (199)

CIR at the NTCIR-17 ULTRE-2 Task

0upvotes

By: Lulu Yu, Keping Bi, Jiafeng Guo, Xueqi Cheng

The Chinese academy of sciences Information Retrieval team (CIR) has participated in the NTCIR-17 ULTRE-2 task. This paper describes our approaches and reports our results on the ULTRE-2 task. We recognize the issue of false negatives in the Baidu search data in this competition is very severe, much more severe than position bias. Hence, we adopt the Dual Learning Algorithm (DLA) to address the position bias and use it as an auxiliary model... more

Information RetrievalOctober 20, 2023 10:23am

Comments (0)
Views (207)

Simulating Users in Interactive Web Table Retrieval

0upvotes

By: Björn Engelmann, Timo Breuer, Philipp Schaer

Considering the multimodal signals of search items is beneficial for retrieval effectiveness. Especially in web table retrieval (WTR) experiments, accounting for multimodal properties of tables boosts effectiveness. However, it still remains an open question how the single modalities affect user experience in particular. Previous work analyzed WTR performance in ad-hoc retrieval benchmarks, which neglects interactive search behavior and lim... more

Information RetrievalOctober 20, 2023 10:00am