見出し画像

【速報】元Twitter社員がAI、DMへのアクセスの詳細について告白独占情報（全日本語訳）

2022年12月14日 00:54

つい先日Twitterのタイムラインで流れてきたツイート。

BREAKING: Former Twitter employee shares exclusive details with me on AI, Access to DMs, and more.

Thread below ⬇️
— Kristen Ruby (@sparklingruby) December 9, 2022

クリスティン・ルビー氏（アメリカのソーシャルメディア・アナリスト、TV司会者、政治コメンテーター）によって元Twitter社員に独占取材した内容を、スレッド化してツイートし、公開されました。

以下、ツイートの内容を日本語に全訳したものを掲載します。
（日本語訳 : 添削・推敲藤田二郎/ITER）

注）クリステン・ルビー氏のインタビューに答えた元Twitter社員の返信は画像として掲載されおりました。その画像をMicrosoft Office LensのOCRでテキスト化したものを掲載しております。元文章がタイポ含みのものでしたので、英単語として分かりにくいものがある点はご了承ください。

BREAKING: Former Twitter employee shares exclusive details with me on AI, Access to DMs, and more. Thread below

Kristen Ruby @sparklingruby : Twitter

速報: ツイッター社の元従業員がAIやDMへのアクセスなど多くの詳細について独占的にシェアしてくれました。スレッド続く。

1. What is guano?

So every database serves a purpose. Tracking actions or tweets etc etc. we had probably 100s if not 1,000s of databases each with several tables. Ok. So the raw data existed in a guano table in each (most?) databases. They were called guano because they had 'dirty' non-formatted data. You could also make notes in them. If a tweet was 'actuoned' meaning we marked it in some way it would have a guano entry.

Kristen Ruby @sparklingruby : Twitter

1.グアノとは？

どのデータベースもその役割があります。行動やツイートの追跡などなど。それぞれ、いくつかのテーブルを持つデータベースを100かもしかしたら1000以上を所有している。だから、それぞれ（ほとんど？）のデータベース内にある「グアノ・テーブル」の中に生データが存在している。それらを我々はグアノと呼んでいる。その中にはフォーマットされていない「汚れた」データが含まれているからだ。あなた方はその点によく注意してもらいたい。もし、あるツイートに動きがあるとマークされたら、いくつかの方法でグアノに登録される。

※Guanoは鳥などの糞堆積物が化石化したもの。糞化石とも言う。主に肥料に使われる。恐らくTwitterの「小鳥の糞」という比喩。

2. Guano further explained

Tl;Dr sometimes we would take notes on a tweet and those notes would be collected in a guano table.

Kristen Ruby @sparklingruby : Twitter

2.グアノを更に説明すると

長すぎましたね。ツイートを記録し、その記録はグアノテーブルに集められる事もあります。

3. What is the testing tab?

So guano is as I defined. Let me know if that's not clear. Testing is when you have a machine learning algorithm and want to 'test' to see how good a job it did.

Kristen Ruby @sparklingruby : Twitter

3.検査タブとは何の事ですか？

私が定義したようにグアノはこうなります。不明瞭だと感じたら教えてください。テスティングは機械学習アルゴリズムがある時、それがどれだけよく機能してるかを知りたいときの検査方法です。

4. How does Twitter approach training data and AI? I predict Twitter Spaces data could be used to train a model.

Yeah. Definitely. Any thing on the site is fair game.
We used everythingto train.

Kristen Ruby @sparklingruby : Twitter

4.Twitter社はAIの学習データに関して、どのようにアプローチしているのですか？Twitterスペースのデータは学習モデルとして利用できるものだと私は推測しています。

まさにその通り。サイト上にある全てのものは公正な獲得物です。我々は全てのものを学習に使いました。

5. Can any employee read users Twitter DMs?

Yes. We could read them because they were a frequent source of abuse or threats. Again, strict protocols, but yeah.
So if I am being harassed
I might report that so
Twitter can take action.

Kristen Ruby @sparklingruby : Twitter

5.従業員はユーザーのダイレクトメッセージを閲覧することはできますか？

はい。我々はそれを閲覧することができました。何故なら、DMは乱用や脅威を察知する、一般的な情報源だったからです。
しかも、厳格な手順（プロトコル）だけど。
なので、もし私が侵害行為を受けていると感じたら、私にそういう報告されるだろうし、Twitterはアクションを起こすことができる。

6. What protocol was in place for access to confidential user data and direct messages?

We dealt with privacy by trying to restrict access to need to know.
Hmmm. Right. As far as bans. The trends blacklist is indicating it's on a list of trends blacklisted.
Meaning it violated one of our policies.

Kristen Ruby @sparklingruby : Twitter

6.ユーザーの機密情報やDMへ適切にアクセスできるようにどのようなプロトコルが設定されていたのでしょうか？

我々はプライバシーに配慮して、知る必要のある事柄に対してだけに制限的にアクセスするよう対処しました。
うーん、そうね、バンに限って言うなら、ブラックリストの傾向、リストに上がっているものを示唆しているか。
我々の指針に違反しているかという意味で。

7. Protocol:

Yeah. I don't know. That note up top is. We couldn't take action on accounts with over 10k followers. That had to be raised.

Kristen Ruby @sparklingruby : Twitter

7.プロトコル

そうね、私は知らない。一番上にある項目が一番興味深い。我々は1万人以上のフォロワーを抱えるアカウントに対してはアクションを起こせなかった。それらは引き上げる必要があった。

8. Did you have access to private user direct messages?

Ok. Yeah. We had access to tweets. That's how we trained our models. We would mark a tweet as misinfo (or not) or abuse or whatever like copyright infringement. People who had access were pretty strictly controlled and you had to get sign off to get access. BUT a fair amount of people working in Trust and Safety did have it.

Kristen Ruby @sparklingruby : Twitter

8.あたなはユーザーのプライベートなDMにアクセスできたのですか？

もちろん。我々はツイートをアクセスし続けてきた。それは我々の（機械学習）モデルをどのように学習させるか。あるツイートが誤報なのかそうでないのか、乱用や著作権侵害のようなものでないかをマークした。そういうアクセスがある人たちはまず同意書に署名をしてアクセスを認可されていました。その一方で「Trust and Safety」の多くの人がそういうアクセス権を持っていました。

9. Did Twitter engage in shadow banning?

Ok. So, and I don't mean to sound duplicitous, but let's first define shadow banning. Twitter considered shadow banning to be where the account is still active but the tweets are not able to be seen. The user is literally posting into thin air.

Kristen Ruby @sparklingruby : Twitter

9.Twitter社はシャドウバンに関与した？

OK、そう、二重に聞こえるつもりはありませんが、最初にシャドーバンを定義しましょう。 Twitterはアカウントがまだアクティブであるが、ツイートを見ることができない場合に、シャドーバンを検討しました。そういったユーザーの投稿は文字通り跡形もなく消える。

10. Shadow banning continued:

Here is what we did. We had terms (which, yes, were not publicly available) along the lines of hate speech, misinfo, etc. if your account repeatedly violated this but we decided not to ban you - maybe you were a big account, whatever we could 'deamplify' That's what I mean by corporate speech seeming shady. Lol. So, your tweet and account were still available to your followers. They could still see, but the tweets were hidden from search / discovery.

Kristen Ruby @sparklingruby : Twitter

10.シャドウバンに関して（続き）

こういう事をしていました。公にしていない条項が存在していました、例えばヘイトスピーチの系統に沿う、誤報などなど。仮に誰かがその条項の違反を何度も繰り返してもバンされることは通常ありませんでした。仮にそのアカウントが多くのフォロワーを抱える「大きなアカウント」をどのような「弱体化」したとしても。それが企業アカウントがいかに怪しいか、という事ですね。したがってあなたのツイートとアカウントはまだフォロワーにはアクセスでき、見る事はできるが検索したり偶然見つけたりする事は隠されているのでできない。

11. Shadow banning cont:

We tried to be public about it. Here's a blog post.
Setting the record straight on shadow banning
blog.twitter.com
Not a perfect system. You could argue, reasonably I think, that this is shadow banning. But that's why we said we didn't.

Kristen Ruby @sparklingruby : Twitter

11.シャドウバンについて（続き）

我々はその事について公開しようと試み、それをブログに投稿した（blog.twitter.com）
シャドウバンに繋がる場合の記録

完璧なシステムではない。シャドウバンがそれだと議論する事は可能だと思います。しかしだからこそそうではない、と言ったのです。

12. Were you aware that people were publicly banned from searches? How much of this was shared with employees vs. limited to certain employees?

Me? No. I didn't know. Those screen shots you've seen are from a tool called Profile Viewer 2 (PV2). If it's data that's captured. As this data was, SOMEONE could see it and was aware. But not me.
Those are tools for people we called 'agents'. People who manually banned tweets or accounts.
I was definitely an individual contributor and didn't have the full picture.

Kristen Ruby @sparklingruby : Twitter

12. 人々が公に検索からバンされていることを知っていましたか? そのうちのどの程度が従業員と共有されたのか、特定の従業員に限定されたのか?

私？いいえ、知りませんでした。ご覧になったスクリーンショットは、Profile Viewer 2 (PV2) というツールからのものです。キャプチャされたデータの場合。このデータがそうであるように、誰かがそれを見たら気が付くだろう、とはわかっていました。しかし、私では知りませんでした。
これらはエージェントと呼ばれる人達のためのツールです。 ツイートやアカウントを手動でバンした人達。
私は間違いなく、いち個人の投稿者であり、全体像を把握していませんでした。

13. Was shadow banning and search blacklisting info shared with other teams?

No.
Heavily siloed data. I literally just counted violations to see how good our algorithms were performing.
Hold on. That is part of content moderation. Technically called H
COMP
So they were aware of it. But not me
Few employees were aware Managers
in the thread of former ？？？ are like 'what?'

Kristen Ruby @sparklingruby : Twitter

13. シャドウバンと検索ブラックリストの情報は他のチームと共有されましたか?

いいえ。
大量のサイロ化されたデータ。文字通り違反とカウントされただけで、私たちのアルゴリズムがどれだけうまく機能しているかを確認しました.
ちょっと待って。これはコンテンツモデレーションの一部です。技術的にH COMPと呼ばれます。ほとんどの従業員は存在を知らないはず。以前の？？のスレッドのマネジャーが（以下不明）

14. Is there a disconnect on the definition of censorship?

I think there's a disconnect between what Twitter considers censorship and what others in public consider censorship.

Kristen Ruby @sparklingruby : Twitter

14.検閲の定義に（世間との）ズレはありますか?

Twitterが検閲と見なすものと、一般の人々が検閲と見なすものとの間にはずれがあると思います。

15. Censorship question cont:

Twitter feels it was very public about what it was doing. But these were often buried in blog posts.

Kristen Ruby @sparklingruby : Twitter

15.検閲の質問（続き）

Twitterは、それが何をしているかについて非常にオープンにしていたと感じています。しかし、これらはしばしばブログ記事の中に埋もれていました。

16. Database access process:

Well. I think it was a way to keepinfo segregated. To get access to a database you had to submit a request. Requests were for LDAP access. Just directory access. Once granted you had access to all tables in that db.
So maybe some of the eng databases or whatever had 1000s of tables. But I built some of these and for highly specialized purposes.
The schema would be like 'twttr-bq-politicalmisinfo'.So fairly specialized. Not for production so to speak.

Kristen Ruby @sparklingruby : Twitter

16.データベースのアクセスプロセス

そうね、情報を分離しておく方法だったと思います。データベースにアクセスするには、リクエストを送信する必要がありました。リクエストは LDAPアクセス※ に対するものでした。ディレクトリアクセスのみ。許可されると、そのデータベース内のすべてのテーブルにアクセスできました。
おそらくいくつかのエンジニアリングのデータベースか、何千ものテーブルを持っていたものなら何でも。しかし、私はこれらのいくつかを、高度に専門的な目的のために構築しました。
スキーマは「twttr-bq-politicalmisinfo」のようになります

そのため、かなり専門的です。いわば生産用ではありません。

※ LDAPとは、インターネットなどのTCP/IPネットワークでディレクトリサービスにアクセスするための通信プロトコルの一つ。

17. Database access continued:

Two layers of protection. The actions of the agents - human reviewers - were logged and periodically audited. Those are the people looking at those screenshots on PV2. That data went into a data table. You know, it looks like a spreadsheet. Access to that table was strictly limited in the following way. If you needed access for your job you submitted a request. It needed to be signed off by several layers. However, once you had access, you had it. And that access wasn't really monitored.

Kristen Ruby @sparklingruby : Twitter

17.データベースアクセスに関して（続き）

2レイヤー・プロテクション。エージェント (人間のレビュー担当者) のアクションは記録され、定期的に監査されました。 PV2でスクリーンショットを見ている人たちです。
そのデータはデータテーブルに入りました。ご存知のように、スプレッドシートのように見えます。そのテーブルへのアクセスは、次のように厳密に制限されていました。
仕事のためにアクセスが必要な場合は、リクエストを送信しました。複数のレイヤーによる承認が必要でした。ただし、アクセスできるようになると、それが得られます。そして、そのアクセスは実際には監視されていませんでした。

18. Please share more about how AI was used in the content moderation process:

Ok. So we had some words/phrases we were looking for. These were developed in consultation with lots of ppl. Govt agencies, human rights groups, academic researchers.
BUT govt itself can be a big source of misinfo. Right?
Yeah.
So our policy ppl, led by York Roth among others helped figure out terms.
But we weren't banning outright. Here's how it worked.

Kristen Ruby @sparklingruby : Twitter

18. コンテンツ・モデレート・プロセスで AI がどのように使用されたかについて詳しく教えてください。

OK、そのため、探していた単語/フレーズがいくつかありました。これらは、多くの人達と協議して開発されました。政府機関、人権団体、学術研究者。
しかし、政府自体が誤った情報の大きな情報源になる可能性がありますからね。
そのため、特にヨーク・ロスや他のメンバーが率いる私たちのポリシー担当者は、条件を理解するのに一躍買っていました。

しかし、私たちは完全にバンしていませんでした。そう機能していたのです。

19. Twitter AI content moderation continued:

Take for instance the movie 2000 mules. Our opinion of this movie was neutral. We didn't judge it.
But what we'd found was tweets which contained that term frequently were associated with misinfo.
If you tweeted about that, stolen elections, and a few other things, our Al would notice
We created a probability. We would give each tweet a ranking. A score. From 0 to 300. The scale is completely arbitrary but meant tweets with higher score were more likely to have misinfo.

Kristen Ruby @sparklingruby : Twitter

19.TwitterのAIコンテンツ・モデレーション（続き）

たとえば、映画 2000ミュール※ を考えてみましょう。この映画に対する私たちの意見は中立でした。私たちはそれを判断しませんでした。
しかし、私たちが見つけたのは、その用語を頻繁に含むツイートが誤った情報に関連付けられていることでした。
あなたがそれについてツイートした場合、「盗まれた選挙」、他のいくつかのことなども。私たちのAIは気付くでしょう
確率を作成しました。各ツイートにランキングを付けます。スコア0 から 300 まで。スケールは完全に任意ですが、スコアが高いツイートほど誤った情報が含まれる可能性が高いことを意味します。

※ アメリカの陰謀論政治映画

20. Twitter AI Content Moderation continued:

If your tweet scores high enough (above 150) our algorithm could ban, annotate, or suspend the acct. but we tried to be selective. You'll see in the tweets I send
Not all that scored very highly ended up being banned
I hope that makes sense.

Kristen Ruby @sparklingruby : Twitter

20.TwitterのAIコンテンツ・モデレーション（続き）

ツイートのスコアが十分に高い (150 を超える) 場合、アルゴリズムはアカウントをバン、注釈付け、または停止する可能性があります。 しかし、私たちは選択的なものにしようとしました。あなた達がツイート中で分かるように。
非常に高得点のすべてが最終的にバンされたわけではありません。
それが理にかなっていることを願うばかりです。

21. How is Twitter using Machine Learning/ AI in the content moderation process?

Machine learning models 'learn' by being trained on data. Here are 1 million tweets that are not violations. Here are some that are.
Then you have it guess. Is this tweet a violation? If it guesses it is a vio and it is that's a true positive. If it guesses it is a violation and it isn't that's a false positive. If it guesses it's not a violation and it is, that's a true negative. If it guesses it's not a violation and it is, that's a false negative. The idea is to get the true positive and negative up and the false positive and negative down. New data helps with that.

Kristen Ruby @sparklingruby : Twitter

21. Twitterはコンテンツ・モデレート・プロセスで機械学習/AI をどのように使用していますか?

機械学習モデルは、データでトレーニングされることによって「学習」します。違反ではない 100 万件のツイートを以下に示します。ここにいくつかあります。
では、推測してみましょう。
このツイートは違反ですか？

それが違反であると推測し、それが真陽性である場合。
それが違反であると推測し、そうでない場合は偽陽性。
違反ではないと推測し、実際に違反している場合は真陰性。
違反ではないと推測し、実際に違反している場合は偽陰性です。

アイデアは、真陽性と真陰性を上げ、偽陽性と偽陰性を下げることです。新しいデータはそれに役に立ちます。

※ 機械学習用語
True Positive (TP) : 真陽性
Falese Positive(FP) : 偽陽性
False Negative(FN) : 偽陰性
True Negative(TN) : 真陰性

22. Twitter AI + Machine Learning continued:

ML is just statistics. It makes best guesses for unknown data (abuse, political misinfo) based on what it's seen before.

Kristen Ruby @sparklingruby : Twitter

22.Twitter AI + 機械学習 (続き)

機械学習は単なる統計です。既知のものに基づいて、未知のデータ (悪用、政治的な誤った情報) について最善の推測を行います。

【まとめ】
いかがでしたでしょうか？
皆さん、それぞれの感想を持たれた事だとおもいます。ただTwitter社内部のシャドウバン等に関して驚くべき情報が開示された意義は大きいでしょう。

あくまで、私の私見ではありますが。イーロン・マスクは単にITプラットフォームを買収したのではない。想像を超えた言論プラットフォームと化したSNS。しかも歪んだリコメンドエンジンにより世論さえ左右した由々しき事態を見過ごせなかったのだろう。

これによりITという道具の使い道を誤ると何が起こるのかを示した功績は多大であるのではないでしょうか？

2022年12月14日
奥津哲雄

この記事が気に入ったらサポートをしてみませんか？