Interpretation of KDD2024 Best Student Paper, University of Science and Technology of China, Huawei Noah: New Paradigm of Sequence Recommendation DR4SR-AI-php.cn

Home

Interpretation of KDD2024 Best Student Paper, University of Science and Technology of China, Huawei Noah: New Paradigm of Sequence Recommendation DR4SR

王林

Sep 02, 2024 pm 03:07 PM

project DR4SR KDD2024

Interpretation of KDD2024 Best Student Paper, University of Science and Technology of China, Huawei Noah: New Paradigm of Sequence Recommendation DR4SR

AIxiv專欄是本站發布學術、技術內容的欄位。過去數年，本站AIxiv專欄接收通報了2,000多篇內容，涵蓋全球各大專院校與企業的頂尖實驗室，有效促進了學術交流與傳播。如果您有優秀的工作想要分享，歡迎投稿或聯絡報道。投稿信箱：liyazhou@jiqizhixin.com；zhaoyunfeng@jiqizhixin.com

本實驗室工作由認知智慧陳恩紅團隊與華為諾亞方舟實驗室完成。陳恩紅教授團隊深耕資料探勘、機器學習領域，在頂尖期刊與會議發表多篇論文，Google學術論文引用超兩萬次。諾亞方舟實驗室是華為公司從事人工智慧基礎研究的實驗室，秉持理論研究與應用創新並重的理念，致力於推動人工智慧領域的技術創新與發展。

8 月25 日- 29 日在西班牙巴塞隆納舉行的第30 屆ACM 知識發現與資料探勘大會(KDD2024) 上，中國科學技術大學認知智能全國重點實驗室陳恩紅教授、 IEEE Fellow，和華為諾亞聯合發表的論文“Dataset Regeneration for Sequential Recommendation”，獲2024 年大會Research Track 唯一最佳學生論文獎。論文第一作者為中科大認知智能全國重點實驗室陳恩紅教授，連德富教授，與王皓特任副研究員共同指導的博士生尹銘佳同學，華為諾亞劉勇、郭威研究員也參與了論文的相關工作。這是自 KDD 於 2004 年設立該獎項以來，陳恩紅教授團隊的學生第二次榮獲該獎項。

Paper link: https://arxiv.org/abs/2405.17795
Code link: https://github.com/USTC -StarTeam/DR4SR

Research motivation

Sequence recommendation System (Sequential Recommender, SR) is an important part of modern recommendation systems because it aims to capture users' changing preferences. In recent years, researchers have made a lot of efforts to enhance the capabilities of sequence recommendation systems. These methods usually follow a model-centric paradigm, which is to develop effective models based on fixed data sets. However, this approach often overlooks potential quality issues and flaws in the data. In order to solve these problems, academic circles have proposed a data-centric paradigm, which focuses on using fixed models to generate high-quality data sets. We frame this as the “dataset reconstruction” problem.

In order to obtain the best training data, the key idea of the research team is to learn a new data set that explicitly contains item transfer patterns. Specifically, they divided the modeling process of the recommender system into two stages: extracting transfer patterns 〈🎜〉 from the original data set, and learning user preferences 〈🎜〉 based on 〈🎜〉. This process is challenging since learning a mapping from Interpretation of KDD2024 Best Student Paper, University of Science and Technology of China, Huawei Noah: New Paradigm of Sequence Recommendation DR4SR

involves two implicit mappings: Interpretation of KDD2024 Best Student Paper, University of Science and Technology of China, Huawei Noah: New Paradigm of Sequence Recommendation DR4SR

. To this end, the research team explored the possibility of developing a dataset that explicitly represents the item transfer patterns in Interpretation of KDD2024 Best Student Paper, University of Science and Technology of China, Huawei Noah: New Paradigm of Sequence Recommendation DR4SR

, which allows us to explicitly separate the learning process into two stages, where Interpretation of KDD2024 Best Student Paper, University of Science and Technology of China, Huawei Noah: New Paradigm of Sequence Recommendation DR4SR

is relatively easier to learn . Therefore, their main focus is to learn an efficient mapping function for Interpretation of KDD2024 Best Student Paper, University of Science and Technology of China, Huawei Noah: New Paradigm of Sequence Recommendation DR4SR

, which is a one-to-many mapping. The research team defines this learning process as the dataset regeneration paradigm, as shown in Figure 1, where “regeneration” means that they do not introduce any additional information and only rely on the original dataset. Interpretation of KDD2024 Best Student Paper, University of Science and Technology of China, Huawei Noah: New Paradigm of Sequence Recommendation DR4SR

^{그림 1 중앙 패러다임인 DR4SR(Dataset Regeneration for Sequence Recommendation)}은 원본 데이터세트를 유익하고 일반화 가능한 데이터세트로 재구성하는 것을 목표로 합니다. 구체적으로 연구팀은 먼저 데이터 세트를 재생성할 수 있도록 사전 훈련 작업을 구축했습니다. 다음으로 그들은 재생 과정에서 서열과 패턴 사이의 일대다 관계를 모델링하기 위해 다양성이 강화된 재생기를 제안했습니다. 마지막으로 그들은 새로운 데이터세트를 생성하기 위해 탐색과 활용 사이의 균형을 맞추는 하이브리드 추론 전략을 제안합니다.

데이터 세트 재구성 프로세스는 일반적이지만 특정 대상 모델에 완전히 적합하지 않을 수 있습니다. 연구팀은 이 문제를 해결하기 위해 대상 모델의 특성에 따라 데이터 세트를 맞춤화하는 모델 인식 재생 프로세스인 DR4SR+를 제안했습니다. DR4SR+는 점수를 개인화하고 2계층 최적화 문제와 암시적 차별화 기술을 통해 재구성된 데이터 세트의 패턴을 최적화하여 데이터 세트 효과를 향상시킵니다.

연구 방법

본 연구에서 연구팀은 A 데이터를 제안했다. "시퀀스 추천을 위한 데이터 재생성"(DR4SR)이라는 중심 프레임워크는 그림 2와 같이 원본 데이터 세트를 유익하고 일반화 가능한 데이터 세트로 재구성하는 것을 목표로 합니다. 데이터 재생성 프로세스는 대상 모델과 독립적이므로 재생성된 데이터 세트가 반드시 대상 모델의 요구 사항을 충족하지 못할 수도 있습니다. 따라서 연구팀은 DR4SR을 모델 인식 버전, 즉 DR4SR+로 확장하여 재생성된 데이터 세트를 특정 대상 모델에 맞게 조정했습니다.

모델에 구애받지 않는 데이터 세트 재구성

> 그림 2 재생기 데이터 세트의 자동 재생성을 용이하게 합니다. 그러나 원본 데이터 세트에는 데이터 세트 재생기 학습을 위한 감독 정보가 부족합니다. 그러므로 자기주도 학습 방식으로 이를 달성해야 합니다. 이를 위해 다양성이 강화된 재생기의 학습을 안내하기 위한 사전 훈련 작업을 도입합니다. 사전 훈련을 마친 후 연구팀은 하이브리드 추론 전략을 사용하여 새로운 데이터 세트를 재생성했습니다.

Interpretation of KDD2024 Best Student Paper, University of Science and Technology of China, Huawei Noah: New Paradigm of Sequence Recommendation DR4SR

데이터 재구성 사전 학습 작업 구축:‍

^{~ 그림 3 그런 다음 재생성기} 는 을 해당 패턴

으로 재생성할 수 있어야 합니다. 연구팀은 전체 사전 훈련 데이터 세트를

다양성을 촉진하는 재생기: Interpretation of KDD2024 Best Student Paper, University of Science and Technology of China, Huawei Noah: New Paradigm of Sequence Recommendation DR4SR

으로 표시합니다. 사전 훈련 작업을 통해 연구팀은 이제 데이터 세트 재생기를 사전 훈련할 수 있습니다. 본 논문에서는 재생기의 주요 아키텍처로 Transformer 모델을 채택하고 그 생성 능력이 널리 검증되었습니다. 데이터 세트 재생기는 원본 데이터 세트에서 시퀀스 표현을 얻는 인코더, 패턴을 재생성하는 디코더, 일대다 매핑 관계를 캡처하는 다양성 향상 모듈의 세 가지 모듈로 구성됩니다. 다음으로 연구팀은 이들 모듈을 별도로 소개할 예정이다.

인코더는 다중 스택형 MHSA(Multi-Head Self-Attention) 및 FFN(Feed-Forward Network) 레이어로 구성됩니다. 디코더의 경우 데이터 세트 X'의 패턴을 입력으로 재현합니다. 디코더의 목표는 인코더에서 생성된 시퀀스 표현을 바탕으로

패턴을 재구성하는 것입니다. 그러나 시퀀스에서 여러 패턴을 추출할 수 있습니다. . 훈련 중에 어려움을 겪을 수 있는 모드입니다. 이러한 일대다 매핑 문제를 해결하기 위해 연구팀은 다양성 향상 모듈을 추가로 제안했습니다.

Interpretation of KDD2024 Best Student Paper, University of Science and Technology of China, Huawei Noah: New Paradigm of Sequence Recommendation DR4SR

구체적으로 연구팀은 타겟 패턴의 정보를 디코딩 단계에 통합하여 원본 시퀀스의 영향을 적응적으로 변조합니다. 먼저, 인코더에 의해 생성된 메모리

를

다양한 벡터 공간, 즉

에 투영합니다. 이상적으로는 서로 다른 대상 패턴이 서로 다른 메모리와 일치해야 합니다. 이를 위해 대상 패턴을 인코딩하고
을 얻기 위해 Transformer 인코더도 도입했습니다.

를 확률 벡터로 압축했습니다.

여기서

는 k번째 메모리를 선택할 확률입니다. 각 메모리 공간이 완전히 훈련되었는지 확인하기 위해 하드 선택을 수행하지 않고 대신 가중치 합을 통해 최종 메모리를 얻습니다.

Interpretation of KDD2024 Best Student Paper, University of Science and Technology of China, Huawei Noah: New Paradigm of Sequence Recommendation DR4SR

궁극적으로 획득한 메모리를 활용하여 디코딩 프로세스를 촉진하고 시퀀스와 패턴 간의 복잡한 일대다 관계를 효과적으로 캡처할 수 있습니다.

모델 인식 데이터 세트 재생성

이전 재생 프로세스 및 대상 모델로 인해 불가지론적이므로 재구성된 데이터 세트가 특정 대상 모델에 적합하지 않을 수 있습니다. 따라서 모델 독립적 데이터 세트 재구성 프로세스를 모델 인식 재구성 프로세스로 확장합니다. 이를 위해 데이터 세트 재생성을 기반으로 재생성된 데이터 세트의 각 데이터 샘플 점수를 평가하는 데이터 세트 개인화 도구를 도입합니다. 그런 다음 연구팀은 암시적 차별화를 통해 데이터 세트 개인화 프로그램을 더욱 효율적으로 최적화했습니다.

Dataset Personalizer:

연구팀의 목표는 구현된 Dataset Personalizer Interpretation of KDD2024 Best Student Paper, University of Science and Technology of China, Huawei Noah: New Paradigm of Sequence Recommendation DR4SR

를 기반으로 매개변수를 훈련하는 것입니다. MLP를 통해 대상 모델에 대한 각 데이터 샘플 Interpretation of KDD2024 Best Student Paper, University of Science and Technology of China, Huawei Noah: New Paradigm of Sequence Recommendation DR4SR

W의 점수를 평가합니다. 연구팀은 프레임워크의 일반성을 보장하기 위해 계산된 점수를 사용하여 훈련 손실의 가중치를 조정했으며 이는 대상 모델에 대한 추가 수정이 필요하지 않았습니다. 원래 다음 항목 예측 손실을 정의하는 것부터 시작합니다.

Interpretation of KDD2024 Best Student Paper, University of Science and Technology of China, Huawei Noah: New Paradigm of Sequence Recommendation DR4SR

이어서 개인화된 데이터 세트에 대한 훈련 손실 함수는 다음과 같이 정의할 수 있습니다.

Interpretation of KDD2024 Best Student Paper, University of Science and Technology of China, Huawei Noah: New Paradigm of Sequence Recommendation DR4SR

실험 결론

본 실험

연구팀은 제안된 프레임워크의 유효성을 검증하기 위해 각 대상 모델의 성능을 “DR4SR” 및 “DR4SR+” 변형과 비교했습니다. 그림 4

Interpretation of KDD2024 Best Student Paper, University of Science and Technology of China, Huawei Noah: New Paradigm of Sequence Recommendation DR4SR

그림 4 성능의 전체 그림을 보면 다음과 같은 결론을 내릴 수 있습니다.

DR4SR은 유익하고 일반적으로 적용 가능한 데이터 세트를 재구성할 수 있습니다

다른 대상 모델은 다른 데이터 세트를 선호합니다

노이즈 제거는 데이터 재구성 문제의 일부일 뿐입니다

The above is the detailed content of Interpretation of KDD2024 Best Student Paper, University of Science and Technology of China, Huawei Noah: New Paradigm of Sequence Recommendation DR4SR. For more information, please follow other related articles on the PHP Chinese website!

Statement of this Website

The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Hot AI Tools

Undress AI Tool

Undress images for free

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

Online AI tool for removing clothes from photos.

Clothoff.io

AI clothes remover

Video Face Swap

Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Article

RimWorld Odyssey Temperature Guide for Ships and Gravtech

1 months ago By Jack chen

RimWorld Odyssey How to Fish

1 months ago By Jack chen

Can I have two Alipay accounts?

1 months ago By 下次还敢

Beginner's Guide to RimWorld: Odyssey

1 months ago By Jack chen

PHP Variable Scope Explained

3 weeks ago By 百草

Hot Tools

Notepad++7.3.1

Easy-to-use and free code editor

SublimeText3 Chinese version

Chinese version, very easy to use

Zend Studio 13.0.1

Powerful PHP integrated development environment

Dreamweaver CS6

Visual web development tools

SublimeText3 Mac version

God-level code editing software (SublimeText3)

Hot Topics

Laravel Tutorial

1603

PHP Tutorial

1506

276

Related knowledge

arXiv papers can be posted as 'barrage', Stanford alphaXiv discussion platform is online, LeCun likes it Aug 01, 2024 pm 05:18 PM

cheers! What is it like when a paper discussion is down to words? Recently, students at Stanford University created alphaXiv, an open discussion forum for arXiv papers that allows questions and comments to be posted directly on any arXiv paper. Website link: https://alphaxiv.org/ In fact, there is no need to visit this website specifically. Just change arXiv in any URL to alphaXiv to directly open the corresponding paper on the alphaXiv forum: you can accurately locate the paragraphs in the paper, Sentence: In the discussion area on the right, users can post questions to ask the author about the ideas and details of the paper. For example, they can also comment on the content of the paper, such as: "Given to

The author of ControlNet has another hit! The whole process of generating a painting from a picture, earning 1.4k stars in two days Jul 17, 2024 am 01:56 AM

It is also a Tusheng video, but PaintsUndo has taken a different route. ControlNet author LvminZhang started to live again! This time I aim at the field of painting. The new project PaintsUndo has received 1.4kstar (still rising crazily) not long after it was launched. Project address: https://github.com/lllyasviel/Paints-UNDO Through this project, the user inputs a static image, and PaintsUndo can automatically help you generate a video of the entire painting process, from line draft to finished product. follow. During the drawing process, the line changes are amazing. The final video result is very similar to the original image: Let’s take a look at a complete drawing.

A significant breakthrough in the Riemann Hypothesis! Tao Zhexuan strongly recommends new papers from MIT and Oxford, and the 37-year-old Fields Medal winner participated Aug 05, 2024 pm 03:32 PM

Recently, the Riemann Hypothesis, known as one of the seven major problems of the millennium, has achieved a new breakthrough. The Riemann Hypothesis is a very important unsolved problem in mathematics, related to the precise properties of the distribution of prime numbers (primes are those numbers that are only divisible by 1 and themselves, and they play a fundamental role in number theory). In today's mathematical literature, there are more than a thousand mathematical propositions based on the establishment of the Riemann Hypothesis (or its generalized form). In other words, once the Riemann Hypothesis and its generalized form are proven, these more than a thousand propositions will be established as theorems, which will have a profound impact on the field of mathematics; and if the Riemann Hypothesis is proven wrong, then among these propositions part of it will also lose its effectiveness. New breakthrough comes from MIT mathematics professor Larry Guth and Oxford University

Posthumous work of the OpenAI Super Alignment Team: Two large models play a game, and the output becomes more understandable Jul 19, 2024 am 01:29 AM

If the answer given by the AI model is incomprehensible at all, would you dare to use it? As machine learning systems are used in more important areas, it becomes increasingly important to demonstrate why we can trust their output, and when not to trust them. One possible way to gain trust in the output of a complex system is to require the system to produce an interpretation of its output that is readable to a human or another trusted system, that is, fully understandable to the point that any possible errors can be found. For example, to build trust in the judicial system, we require courts to provide clear and readable written opinions that explain and support their decisions. For large language models, we can also adopt a similar approach. However, when taking this approach, ensure that the language model generates

LLM is really not good for time series prediction. It doesn't even use its reasoning ability. Jul 15, 2024 pm 03:59 PM

Can language models really be used for time series prediction? According to Betteridge's Law of Headlines (any news headline ending with a question mark can be answered with "no"), the answer should be no. The fact seems to be true: such a powerful LLM cannot handle time series data well. Time series, that is, time series, as the name suggests, refers to a set of data point sequences arranged in the order of time. Time series analysis is critical in many areas, including disease spread prediction, retail analytics, healthcare, and finance. In the field of time series analysis, many researchers have recently been studying how to use large language models (LLM) to classify, predict, and detect anomalies in time series. These papers assume that language models that are good at handling sequential dependencies in text can also generalize to time series.

From RLHF to DPO to TDPO, large model alignment algorithms are already 'token-level' Jun 24, 2024 pm 03:04 PM

The AIxiv column is a column where this site publishes academic and technical content. In the past few years, the AIxiv column of this site has received more than 2,000 reports, covering top laboratories from major universities and companies around the world, effectively promoting academic exchanges and dissemination. If you have excellent work that you want to share, please feel free to contribute or contact us for reporting. Submission email: liyazhou@jiqizhixin.com; zhaoyunfeng@jiqizhixin.com In the development process of artificial intelligence, the control and guidance of large language models (LLM) has always been one of the core challenges, aiming to ensure that these models are both powerful and safe serve human society. Early efforts focused on reinforcement learning methods through human feedback (RL

Topping the list of open source AI software engineers, UIUC's agent-less solution easily solves SWE-bench real programming problems Jul 17, 2024 pm 10:02 PM

The AIxiv column is a column where this site publishes academic and technical content. In the past few years, the AIxiv column of this site has received more than 2,000 reports, covering top laboratories from major universities and companies around the world, effectively promoting academic exchanges and dissemination. If you have excellent work that you want to share, please feel free to contribute or contact us for reporting. Submission email: liyazhou@jiqizhixin.com; zhaoyunfeng@jiqizhixin.com The authors of this paper are all from the team of teacher Zhang Lingming at the University of Illinois at Urbana-Champaign (UIUC), including: Steven Code repair; Deng Yinlin, fourth-year doctoral student, researcher

The first Mamba-based MLLM is here! Model weights, training code, etc. have all been open source Jul 17, 2024 am 02:46 AM

The AIxiv column is a column where this site publishes academic and technical content. In the past few years, the AIxiv column of this site has received more than 2,000 reports, covering top laboratories from major universities and companies around the world, effectively promoting academic exchanges and dissemination. If you have excellent work that you want to share, please feel free to contribute or contact us for reporting. Submission email: liyazhou@jiqizhixin.com; zhaoyunfeng@jiqizhixin.com. Introduction In recent years, the application of multimodal large language models (MLLM) in various fields has achieved remarkable success. However, as the basic model for many downstream tasks, current MLLM consists of the well-known Transformer network, which

See all articles