https://abs.twimg.com/emoji/v2/... draggable="false" alt="🚨" title="Polizeiautos mit drehendem Licht" aria-label="Emoji: Polizeiautos mit drehendem Licht"> Huge new FB/Insta research initiativehttps://abs.twimg.com/emoji/v2/... draggable="false" alt="🚨" title="Polizeiautos mit drehendem Licht" aria-label="Emoji: Polizeiautos mit drehendem Licht"> to finally conduct an independent, pre-registered research on the 2020 election. https://medium.com/@2020_election_research_project/a-proposal-for-understanding-social-medias-impact-on-elections-4ca5b7aae10">https://medium.com/@2020_ele...
This initiative, led by @TaliaStroud @j_a_tucker + @AnnieFranco @chadkdj on the FB side will study social media& #39;s impact on democracy with unprecedented data access.
Many researchers inside FB, myself included, have fought for a long time to see something like this realized.
Get this: it& #39;s all pre-registered. That& #39;s key for the cred of effort, not just to show & #39;I didn& #39;t p-hack& #39;, but for the ORIGINAL motivation--as a credible, costly signal that inconvenient results have not simply been put into the & #39;file drawer& #39; never to see the light of day.
Expect a lot of mixed survey-behavioral data research designs here, because running surveys allows you to get informed consent, which opens up a vast array of potential research designs not possible when just analyzing log data.
There& #39;s also a commitment to establish a process to allow for replication, key for publishing in top journals. This is AFAIK unprecedented and would be a huge innovation with implications for all industry-academic partnerships utilizing big, sensitive data.
And they& #39;ve committed to publish every project period, even if not accepted in a journal.
One of the smartest innovations here, is to have FB staff analyze the data. While the URLs data I helped release has purpose & utility, the *scope of data* potentially on offer here is far broader in scope. I HAVE SO MUCH TO SAY ON THIS POINT KEEP READING
The most important data at Facebook is graph data: the connections

1. between people
2. between people & interests, preferences, group identities
3. between people & ideas

DUH. THAT& #39;S HOW FB KEEPS PEOPLE ON THE SITE & SELLS ADS SO WELL
We all saw how hard it was since 2018 to share ordinary FB data w researchers in ways that complies w/ law on privacy. Just making aggregated data on exposure to URLs under DP was a herculean task, just scroll through the detail in the documentation: https://solomonmg.github.io/pdf/Facebook_DP_URLs_Dataset.pdf">https://solomonmg.github.io/pdf/Faceb...
But it& #39;s (near?) impossible to protect graph data under differential privacy & still allow researchers to answer their questions. This innovation cuts the Gordian knot & allows potentially any data at FB to be analyzed w/ help from top research scientists like @AnnieFranco, who..
like many at FB (& esp Core Data Science) who could be both a top DS an any co & has top-tier research chops (AND btw has a publication in Science on the file drawer problem)
Can we trust FB employees to keep everything legit? Well I trust many of them personally. But you don& #39;t have to, because they& #39;ve built in a monitoring system to ensure scientific integrity, presumably operating under some kind of audit framework.
You can follow @SolomonMg.
Tip: mention @twtextapp on a Twitter thread with the keyword “unroll” to get a link to it.

Latest Threads Unrolled: