The Twitter algorithm is now officially public.

I don't think we can know too much about the "trust and safety" shit without knowing what data it was trained on unfortunately.
Screenshot_20230331_115022_Brave.jpg
 
Besides the established userbase and the pre-existing relationship with advertisers, isn't the algorithm the entire business model? I'm not that tech literate but this seems like a bad business move, no?
I mean your putting aside their entire idea of brand, no one can make a copy of twitter and reach any amount of success close to them. releasing this could be great for PR with transparency
 
Besides the established userbase and the pre-existing relationship with advertisers, isn't the algorithm the entire business model? I'm not that tech literate but this seems like a bad business move, no?
Writing a simple Twitter clone takes a single intermediate programmer and a weekend at most. The value is not in the code.
 
If you look through the Trust and Safety training models and called models, trust and safety models only appear to have been trained (and run) on English tweets and emojis.
That's not even remotely surprising. If the kind of people who fritter about trying to police harmless speech and constantly scream about "toxicity" ever knew what's said about them in the safety of other languages, they'd cry themselves to death.
 
That's not even remotely surprising. If the kind of people who fritter about trying to police harmless speech and constantly scream about "toxicity" ever knew what's said about them in the safety of other languages, they'd cry themselves to death.
Right, just interesting to see it spelled out in the code. It also indicates that they did the training themselves on internal Twitter data since multilingual toxicity models for bert exist
 
Back