今年もRecSys の季節になりました。去年、全部（あとで）みようとおもって息切れし、結局半分くらいしか見なかったので、今年は起きている時間にしかみないことを決めました。時差に抗わない。

Session 2 Sequential Recommendation

Aspect Re-distribution for Learning Better Item Embeddings in Sequential Recommendation

Seq recommendation = next item from given item sequence
「Taylor swift 」というaspectがseq を特徴づけている → これをembeddings に反映させられないか？
まず、item embedding を 3つのaspect に分解。分解は均等に
次に重みをつけて再計算
projection matrixs
手法がgeneral なので、他のものにも応用可能

Defending Substitution-based Profile Pollution Attacks on Sequential Recommenders

seq recsys への攻撃手法と、攻撃から守るための手法
攻撃 = Data Poisoning Attacks と Profile pollutions
攻撃の結果 1. 特定のアイテムの露出を増やす 2. ぴんとこないものばかり推薦するの２パターンある
Attacks Algorithms use perturbation to item space
Defense Objective = robust なモデルに
ディリクレ近傍サンプリング : Augment input items training
Adversal training
検証：攻撃するモデルつくれた / popularなアイテムを過度にpopular にする方が簡単

Context and Attribute-Aware Sequential Recommendation via Cross-Attention

見慣れたTransformerっぽいやつ

推薦には item とuser、両方のprofile が必要
でも計算に時間かかったりして大変
そこで CARCA
実験したらよくできた
どのattribute が効くのかはジャンルによる（セッションなのか、userなのか、itemなのか）
QA
- seq の適度な長さは場合による
- hyper parameter チューニングはがんばった論文に書いた
- timestamp は月、日、曜日、時間、.... と６つの特徴量にわけてもたせた
  - （月や月初か月末なのか、曜日なのかによって、異なりそうなので、かしこい分け方だと思った）

Effective and Efficient Training for Sequential Recommendation using Recency Sampling

Seq Rec ではDeep Learning 使いがちだけど、効率すごいわるい
Recency Sampling で学習すると効率いいよ
近いものを重く、遠いものを軽く重みづけて、学習データを作成する方法

## A Systematic Review and Replicability Study of BERT4Rec for Sequential Recommendation - 前のセッションと同じ人 - BERT4Rec の再現性をひたすら検証 - 370 paper みた - 学習回数、めちゃくちゃ増やさないと論文通りのパフォーマンスにならない - Hugging face の Transformer 使うと、短い学習時間でパフォーマンス出る - 回し者だろうか - QA : Hugging face で変わったことは？　→ ひとことではいえない

Denoising Self-Attentive Sequential Recommendation

Best Paper
Seq に推薦に寄与しないノイズ的なアイテムがはいっちゃうことあるので、それを除きたい
Attention つかってると、勝手に重みづけちゃうし
なので、フィルタリングする。それが、一番上の図
NLP に再輸入することもできると思う
あとで読もう

Session 3 Domain-Specific Recommendation

ドメインSpecific なものは楽しい

Modeling Two-Way Selection Preference for Person-Job Fit

Online Recruiting では Job <-> Person が two-way selection になる
両方反映させたい
Method
- グラフ学習
- 最適アルゴリズム = quadruple-base loss / dual-perspective contrasive
グラフでは pos とneg 、両方のnode をつくり「人が断った仕事」「雇用者が不採用にした人」の関係を表現
データで node の representation を update

Learning Recommendations from User Actions in the Item-poor Insurance Domain

保険の推薦をするモデルをつくった話
保険は難しい
- Item の種類がない → session-based
- 頻繁に買わない → several types
- 電話でもうしこむので、Web のimplicit な情報がない → session-purchase
RNN で過去のsession を学習データとして使うことにより、特徴量をリッチに
session が１番効いた
発表者、緊張してた。

Reusable Self-Attention Recommender Systems in Fashion Industry Applications

-Zalando のIndustrial Session - いろんな推薦のタイプがある - Get-the-look - Inspired-by-you - You-might-also-like - Algorithmic-Fashion-companion - Influencer-profile-recommendation - 今までは複雑なので、それぞれで独立してモデルを学習していた - なので、他のページで行動していても「Cold Start Problem」 - Single Recommender Systemで、All the usecases を賄えないかな？ - Transfomer modelならそれができるんじゃない？

いつものTransormer モデルだ！
特徴量にこつを感じた（最初の写真）
でもなんでも雑にざばーんとconcat してヘテロなデータつくってもなんかなんとかなるのが DL

でてきた画像に対して、自然言語で「もっと黄色いやつ」とか入力していくことによりほしいものを手に入れるタイプの推薦
Gated Recurrent Network を利用すると良い感じになった
自然言語で推薦システムやチャットボットと会話するシーンが全然思い浮かばなかったけど、英語圏の人は自然言語で検索するときいたのでつながった
- 逆にやっぱり日本ではチャットボットいらなさそう

Rethinking Personalized Ranking at Pinterest: An End-to-End Approach

Pinterest のIndustrial Session
Pin / Boards / Feeds
User intention is the most important signals
Long term / Short term のユーザ嗜好を、それぞれ別のモデルつくって捕捉して、組み合わせて推薦する
GPU model の最適化もしたよ

Identifying New Podcasts with High General Appeal Using a Pure Exploration Infinitely-Armed Bandit Strategy

Spotifyによる新しい Podcast の推薦問題
画面がみえなくてきいてなかった
左に目次を出し続けるスライドがいいな、と思ったが、本題と関係はない

Translating the Public Service Media Remit into Metrics and Algorithms

ZDF というメディアの推薦。Industrial Session
公共のサービスなので、ビジネス的に良い結果なだけじゃだめ
PSMV : diversity balance や educational なこと、coverage , novelty,serendipity 、transparency があることが必要
定性的に掲げるだけでなく、Value を図るメトリクスをつくった（これが大事）
指標により、価値を維持できることが監視できる
メトリクス、だよなあ

感想

Transformer まつり
Abstract は読んでいけ

なんとかエンジニアのきろく ... ばばあやこのtech blog

RecSys 2022 Day1

Session 2 Sequential Recommendation

Aspect Re-distribution for Learning Better Item Embeddings in Sequential Recommendation

Defending Substitution-based Profile Pollution Attacks on Sequential Recommenders

Context and Attribute-Aware Sequential Recommendation via Cross-Attention

Effective and Efficient Training for Sequential Recommendation using Recency Sampling

Denoising Self-Attentive Sequential Recommendation

Session 3 Domain-Specific Recommendation

Modeling Two-Way Selection Preference for Person-Job Fit

Learning Recommendations from User Actions in the Item-poor Insurance Domain

Reusable Self-Attention Recommender Systems in Fashion Industry Applications

Rethinking Personalized Ranking at Pinterest: An End-to-End Approach

Identifying New Podcasts with High General Appeal Using a Pure Exploration Infinitely-Armed Bandit Strategy

Translating the Public Service Media Remit into Metrics and Algorithms

感想

Session 2 Sequential Recommendation

Aspect Re-distribution for Learning Better Item Embeddings in Sequential Recommendation

Defending Substitution-based Profile Pollution Attacks on Sequential Recommenders

Context and Attribute-Aware Sequential Recommendation via Cross-Attention

Effective and Efficient Training for Sequential Recommendation using Recency Sampling

Denoising Self-Attentive Sequential Recommendation

Session 3 Domain-Specific Recommendation

Modeling Two-Way Selection Preference for Person-Job Fit

Learning Recommendations from User Actions in the Item-poor Insurance Domain

Reusable Self-Attention Recommender Systems in Fashion Industry Applications

Multi-Modal Dialog State Tracking for Interactive Fashion Recommendation

Rethinking Personalized Ranking at Pinterest: An End-to-End Approach

Identifying New Podcasts with High General Appeal Using a Pure Exploration Infinitely-Armed Bandit Strategy

Translating the Public Service Media Remit into Metrics and Algorithms

感想