RecSys 2019 にいってきた　2019/09/16 Day 1 - なんとかエンジニアのきろく ... ばばあやこのtech blog

RecSys 2018 の論文を読み終えることなく 2019 に行ってしまいました。社内のqiita においたメモを自分の記録としておいておきます。

プログラム → https://recsys.acm.org/recsys19/program/ f:id:ayakobaba:20191208180526j:plain

Welcome Session

f:id:ayakobaba:20190916104059j:plain

場所がら北米、ヨーロッパが多い f:id:ayakobaba:20191208180850j:plain

日本からは4/10、Accept。キーワード。やっぱりCFつよい。 f:id:ayakobaba:20191208180857j:plain

だいたいalgorithm系。はじめてUX research が2番目に f:id:ayakobaba:20191208180910j:plain

Best Paper Award : Are We Really Making Much Progress ? DL の論文の検証したら全然再現できなかった、ってやつ

Keynote : Rude Awakenings from behaviouris dreams methodological integrity and th GDPR

スピーカー：lawyer 、 CS の人に法律を教えてる人。皮肉な感じ

* 行動ターゲティングはうまくいかなくなっている
    *  moving out of RTB at youtube
* Problem of Behaviourism 
    * ResSys では Filtering is a must.  Content filtering / Collaborativ filtering /callibration
    * ただ、Behavioural search , advertising ... はOpening the pandora's box= promoting addiction
    * Recsys が  userのためになる とは限らない。基本的には "economic incentive structure"
    * GDPR change the incentive structure
    * Political economy of recsys : Mining / Inferring / Creating ( = 機械がそういったらそうなる） preferences  
*  behaviourism and micro-targeting
      * 行動データですべてを判断してしまう。内面とか個性とかも （編集済み） 
      * behaviourism パブロフ、ワトソン、チューリング...   nudge theory
      * What can be controlled matters / What mattters can be controlled
         * → 行動データを使うと、fundamentally unreliable  results に　 (コンピュータが過去のimplicit なデータに基づくので修正できない）
* GDPR
  * データの対象もすごくひろい、対象者は"natural person"。
  * Controller Is Liable. ( Publisher / Adviteser / Webshop/ Google / Facebook ) 
  * どんなデータ処理も legal ground (明瞭なInformed Consent とか)が必要
  * principle( purpose limitation , accuracy, transparency, accountability ) 
  * GDPR Pricnciple = PURPOSE ( 誰に責任があって、データを処理するにはとてつもない責任が伴うということを明確にしている)
  * Transparency in ML Hofman,Sharma , Watts (2017)  
 
（結論）GDPR はincentive structure を変えた

質問が哲学の経験主義とかになってわからなかった ... quantification / qualification の比較とか。ヨーロッパだなーと思いました。

Paper Session 1

◆ Personailized Re-ranking for Recommendation

f:id:ayakobaba:20191208180920p:plain

Alibaba の人
ランキング大事。オークションをとりあげる
ランキング、score つけて スコア順に並べる
mutal influence がむしできない（redundancy  , supplement) し、個々に違いがある
モデル : 最初のlist のitem をpersonalized vector  でrerank
personalized vector はあらかじめ学習

◆Online Ranking Combination

goal:optimize convex combination of ranking models
objective : choosing combination weights 
一般的な方法：サロゲート関数を使う 
ExpW :can't scale
SPSA : 一歩ずついくのでlocal maximum に
RSPSA : SPSA with RProp( RProp : step size is updated based on the sign of)
RFDSA+ : Switch to finite differences 
 (なんで組み合わせようとしたんだろう

◆Pareto-Efficient Algorithm for Multiple Objective Optimization in E-Commerce Recommendation

このひともAlibaba 
目的がいっぱいある( Accuracy vs Diversity , Fairness, Popularity and Longtail)
パレート効率性は最低限みたしたい！
遺伝的アルゴリズム ？ Scalarizaton technique
ベースは Scalarization 、Gradient Descent 
全ての損失関数に対してよいdirection を探す

◆From Preference into Decision Making: Modeling User Interactions in Recommender Systems

f:id:ayakobaba:20191208181650p:plain

Bloomberg のひと
inaction からも学習するよ
同じページで見たけど虫したやつを反映させたい / このページを3回みたいとしたら
youtube みて2つきにいった、１こ見て戻ったらもうひとつがきえてる
Decision Field Theory attention contrast valence prefrence
Page-Level RNN Model
SVD SVD++ RNN と比較したら MAP@8 でこれらを上回るよいけっか
Independent binary , Competitive , Relative , Negative sampling , Displayed but inaction
inaction をいれるならpairwise ranking がrobust

◆Deep Generative Ranking for Personalized Recommendation

北京大学のひと
DN つかいたいけど、high bias / sparse では難しい / rank 学習の機構をもたない
Deep Generative Ranking
Pointwise feedback data generation : Beta-Bernoulli non linear latent factor model <-Wassetstein auto-encoder /Beta-ベルヌーイはスパースに適切
Pairwise ranking list creation : Deep Bayesian personalized ranking model
上の２つを組み合わせたよ
( Generative 感がわからなかった implict feedback data をgenerate したらしいが

◆Recommending What Video to Watch Next: A Multitask Ranking System

f:id:ayakobaba:20191208180926p:plain f:id:ayakobaba:20191208180933p:plain

google のひと
人の好み = user utility は多様だし、いろんなサイン（like / click / response ）がある→ここからみつけるのたいへん
ranking part のことを話す
youtube Up next
    different and sometimes conflicting objectives
   selection biases in the system
Multi-gate Mixture-of-Expert : Representaion Learning → Task Specific Performance 
Adding Shallow Tower to Model SelectionBias : Factorize selection bias to relevance 
Tradeoff between Effectiveness and Efficiency
( プレゼンがとても笑いとっててグーグルさんすげー

Industry Session 1

◆Using AI to Build Communities Around Interests on LinkedIn

Mission:共通の興味とアクティブな会話のあるコミュニティをつくること
Hashtag, Company, Group, Region, Skill -> Interests
Discover → Engage → Contribute
Beyond Response Prediction
→ Downstream Utilities
→ Creator Objective Formulation
  Incentivizing Creators through Follows Recommendations クリエイターにもっとつくってもらうためにはFeedbackを増やすこと

◆The Trinity of Luxury Fashion Recommendations : Farfetch

f:id:ayakobaba:20191208180938p:plain

ハイファッションのECサイト。おしゃれおねえさんData Scientist
特異：変化が早く、芸術を買うみたいに買う: unique、emotion、高価
Fashion Experts / Data / Experimentation
Data  :  すべてを経験することができない : Why をつうりだせない
Fashion meets Data : Fashion Experts → Data Model 
デザイナーの重要性：people consume fashion the same way they consume music .
89% search query including designer 82 designers only
How to boost the designer? <写真> Cost effective solution
実験実験 →　同じデザイナーの服ばかりレコメンド → +30%
教訓　"Collaboration is key !" ：ドメインエキスパートと一緒に / 分析を自動化しろ / チームと同期しろ

◆Pandora : Just Play Something Awesome: The Personalization Powering Voice Interactions at

Pandora
声UI のRecSys
Known-item queries / Thematic queries / Open-ended queries
知ってるあれを検索
  聴きたい曲がはっきりしてる。聞き取れないとき 、Robust Retrieval/ 履歴よりその人が検索しそうなもの
  UX Consideration → 聞き返す！
今のシチュエーションにあったあれを聞かせて
  relaxing driving or active driving ？ :
     Intent prediction/ Slots filling →　Content tagging / Shared taxonomy Tag based search →　User Item affinity / Query-Result relevance
     State Tracker
  Broad Queries 「やばいやつかけて」
     Bandit Solution 
  評価方法が全然違う / Implicit feedback / Selection bias

◆Bosch : Future of In-Vehicle Recommendation Systems

f:id:ayakobaba:20191208180943p:plain

車のRecSys
Personalized / Automated / Connected / Electrified
POI / Fueling / SNS ,   Music のInfotainment , Navigation , Vehicle control (シートベルトとか）
他と違うところ：
 Context Aware Recommendation : Driver and Occupant / Vehicle Context
Connected Recommendation : Location based / Connected buildings / Location Based Social Network
Human Machine Interface : 運転中だからできることが限られる
Privacy & Security : どこにいくかとか究極の個人情報
自動運転：どこまで自動にするか？？
Our vision : understands me before ,during and after my trip
例：convenience charging 
1. access management 2 range projection 3 route planning 4 location-based services
Architecture:写真
QA: この推薦システムって必要あるの？ (... ひどい)

◆IKEA : Designer-Driven Add-to-Cart Algorithms

f:id:ayakobaba:20191208180950p:plain

IKEA の読み方はアイキア
Designer-driven recommendations
なぜこれをつくろうとしたか？　：店と同じ体験をオンラインで
What is style ?  :部屋全体のコーディネート
どうやって？　<写真>→似たテイストのものをおすすめ
商品のイメージ　をCNN  → Dense → Distance → Sigmoid 
違う色・素材・シリースのものは一緒に購入されない
 (embedded space をみせてくれた、おもしろかった）
What is gram matrix ? 商品のイメージ　をCNN  → Gram matrices ー　Dense → Distance → Sigmoid  ってすると「色やスタイルが一緒のものがいっしょに」
( 超早口 ）
QA : 学習データはリアル店舗のデータしかつかってないよ

Industry Session は話す人が慣れてて聴きやすい。ねたも面白いあとどうしてもインド人の英語が聞き取れない

Paper Session 2

◆Users in the Loop: A Psychologically-Informed Approach to Similar Item Retrieval :star:

truefit.com f:id:ayakobaba:20191208180956p:plain

item-item similarity は心理学的ものを虫しがち　→ これをとりいれたい
How do humans judge similarity?  ジェーン はチータのように走る Xチータはジェーンのように走る  つまり非対称
Tversky similarity  (vs Jacard similarity)
目的：どうやってつくる？　
　　forced -choice desings for human choice → どっちの方が似てる？ でtraining data
評価：Tversky similarity のパラメータを学習　
　　いくつかのルールをビジネスからきめて評価する
　　Fashion でやったよ
結論 : できるよ
( Deep じゃなくておもしろかった
QA : 特徴はどうやってつくったの→人海戦術。超大変なので自動でつけるのがFuture Work

◆Explaining and Exploring Job Recommendations: a User-driven Approach for Interacting with Knowledge-based Job Recommender Systems

説明もの「なぜこのjob を推薦したか？」
Interactive & Job Recommender Systems
Labor Market  、UCD Process 、Focus Groups 、Visualization 超工夫した
Exploration/Control 、Explanations、Actionable Insights 
アンケートで評価　→ いい感じ
( 会社の場所が地図ででるUI、いい
( ドメイン、kuleuven.be とは = ベルギーの大学
QA ：雇用が増えたとかそこまでみた？ってchairman からの質問

◆Designing for the Better by Taking Users into Account: A Qualitative Evaluation of User Control Mechanisms in (News) Recommender Systems

Curse or Blessing? 
情報は民主化にcrucial
Fair News Project 
 ethnographic research
 simulation model
 news reader research <-これ
特にUser Control に注目
Recsys のプロトタイプをつくって、Qualitative research
自分の好みとか入れられる。
リーディング履歴 効果的、ニュースのカテゴリを見せるのも自覚できるので
divergence
順番を変えるだけでは不十分
自分でコントロールすると、個人の目的も達成しやすくなるし、活性化する
QA: ユーザコントロールが多様性を導くと思えない→ ある面では。でもフェアで透明性が保たれてexplainableになることは、フィルターバブルされてることを認識して多様性を求めることにつながると思う

◆Efficient Privacy-Preserving Recommendations based on Social Graphs

この辺からスマホでメモとってるのでうすい f:id:ayakobaba:20191208181002p:plain f:id:ayakobaba:20191208181008p:plain

Association rule mining 
Privacy 興味で人がわかっちゃう
distributed privacy preserving ARM
ユーザー サンプリング
homophily インフルエンサー
グラフ理論でランダムウォーク 
プライバシーとは？
anonymous user sampling
(苦手なグラフ理論でわからん/というか、課題がわからん

◆PrivateJobMatch: A Privacy-Oriented Deferred Multi-Match Recommender System for Stable Employment

Job recommender system
Both side recommend candidate job description
DAA 1to1 only
Mixed mmdaa mmda で空いたところをlmf mmdaa で埋める
Minimize vacancy！
privatejobmatch github

◆User-Centered Evaluation of Strategies for Recommending Sequences of Points of Interest to Groups

ttdp をどう解くか？ <- ttdp ってなんだろう、最適化問題？
  how to generate group
  グループ推薦の戦略
  旅行会社のグループツアーで人数が多くていきたいところがばらけている場合、
  temporary でグループにわけて別々のルート行ってもらった方が満足度あがるよね
  という問題にチャレンジ
相互推薦システムを、つくって、グラフで解く
検証 ：3つのコースを作成し、アンケートで定性評価（旅の途中でいったん別れるのもいいか？/推薦されたコースの評価)
だけどいろいろある 
  食事は一緒に食べたい、とか。
  小さい子どもがいるグループは別れたくない、とか。
  そーいう自明な要求をどう組み込んでどうコースとグループを組み立てるか？