Lolcow KingCobraJFS / Josh Saunders - Amateur musician, YouTube Streamer, wandmaker, and self-proclaimed "sexy goth badboy". Perpetually circling the drain.

  • 🐕 I am attempting to get the site runnning as fast as possible. If you are experiencing slow page load times, please report it.
any here into stylometrics? not a data scientist so i could be reading all this wrong
What a coincidence! I've done some semi-professional work with stylometry and yeah, you're reading this completely wrong.
Word length doesn't mean much, aside from a general indication that two people write at about the same level. A child will use shorter words, an academic paper will use longer words. There's not a whole lot of meaningful information to be gleaned from word length, at least on its own. Sentence length is similarly meaningless. Paragraph length is a bit more useful but not enough to make a real conclusion from.
Letter frequency is totally meaningless, like absolutely a worthless metric. You'll find that basically every chunk of English text, if it's long enough, will share a pretty similar distribution.
Punctuation is, contrary to what may seem obvious, probably among the most useful metrics here, but it's still not a very useful metric on its own. POS distribution is also a useful metric, but again, not on its own.
Unique word choice is a very useful metric in some scenarios - if I remember right, there was some pedophile who ultimately got caught because the initial lead on him was that he greeted people by saying "heya" instead of "hey" or "hi" and "heya" tends to only be used in a fairly small part of the world. However, in this case, it's almost certainly a red herring because we're in a fairly insular community which uses slang that doesn't appear elsewhere. Compared to any member of the public, people who say words like "boglim" or stuff like that will stand out when you analyze text in this way. It points to a similarity with each other versus the general public, but it doesn't point to a similarity between the two writers within the context of this insular community.
ChatGPT's analysis is totally meaningless. I've experimented with using LLMs for stylometry and it turns out they're awful at it because, often enough that it makes their output unreliable, they'll just tell you what they think you want to hear. They're biased towards saying two writers are the same because you've started off by asking the question "are these two writers the same" essentially. You can often get LLMs to completely 180 on an arbitrary analysis like this just by asking "hmm, are you sure about that?" because "hmm, are you sure about that?" tends (tended?) to produce a response of "Oh, you're right, I'm mistaken" in the training data, and then an explanation made up after the fact about why the first explanation was wrong and this new explanation is right.
Using PCA on this data makes no sense. Maybe I'm missing context from what the lead-up to ChatGPT's response was, but PCA isn't an appropriate tool for comparing one piece of data against a second piece of data. It's useful if you're comparing a couple data points against hundreds or thousands of other data points, but comparing 2 things to one another using PCA is kinda baffling, which is why I wonder if I'm missing something from this.
Above, when I said that these metrics are useful, but not on their own, what I mean is that they're useful in aggregate when compared to tons and tons of other data points. Statistically significant differences are visible when you compare arrays of word length, letter frequencies, punctuation frequencies, unique word occurrences, etc, and treat them all as input into more complex tools, but when you're just looking at graphs of these pieces of information in isolation, even things which seem statistically significant (like letter frequency) aren't really. Even then though, these very basic stylometric features aren't very information-rich.
Token sequence analysis or n-gram analysis is generally considered to be a better avenue for authorship attribution, at least to my knowledge. I recommend these papers if you're interested in learning a bit more:
Of particular note:
1711511525696.png

Using purely stylometric features produces really poor results. Barely above coin flip odds, even under good conditions.
 
Goddamnit I figured it was only a matter of time until full blown schitzos found this thread and began smearing shit on the walls but why’d it have to come at a time like this?

Can you guys just take your meds and fuck off back to the Reddit or r/the_boglim or your stupid discord and let us discuss cobras birthday shenanigans in peace without all this retarded off topic data entry shittery and witch-hunting?
 
There's more than 300 people still waiting for her, 4 hours after she was supposed to start the stream. So many cows we document would kill for that many viewers and everyone's just looking at Bjork and bitching about the stream not starting.

What do we think? Fighting or fucking? Or dead from Habanero shits?
 
There's more than 300 people still waiting for her, 4 hours after she was supposed to start the stream. So many cows we document would kill for that many viewers and everyone's just looking at Bjork and bitching about the stream not starting.

What do we think? Fighting or fucking? Or dead from Habanero shits?
whoops, thanks for reminding me to close that tab!
 
What do we think? Fighting or fucking? Or dead from Habanero shits?
Honestly, they’re probably just too drunk to function right now and the hag just completely forgot about the scheduled stream altogether. They’re also most likely fighting and/or fucking too on top of being boozed up.
 
Last edited:
On this day (well, yesterday) our Bog Boy was born 33 years ago. He did not stream, which was bad for us, but maybe it was good for him.

He shares his birthday with Nancy Pelosi (possible lizard), Richard Dawkins (king autist), Viktor Frankl (no meaning to be found here), and Steven Tyler (probably diddled groupies).

What a poop of a day. It's not 4am in Casper, yet, there's still hope.

New Bog Chron, at least we get something.
 
I strongly believe Cobes is in big doodoo with his apartment building and may even be getting evicted. Something is definitely up. Let me break down my deductive retardation:

- The boy seemed to be projecting a lot and coping in the mead video: like I mentioned earlier he said something like “I’d never call the cops or complain about any of my neighbors no matter how loud they are”

This comes out of nowhere and there seems to be no reason to bring it up in a mead review. It’s because he’s just thinking out loud. He’s clearly upset and it seems like he feels hurt/betrayed. It’s like he’s saying “I would never complain about my neighbors so it’s unfair that they complained about me”

- he points out he’s not having a good day and that he’ll “work through this”
but what is he working through? It’s clear in the video he and jessica had made up from the night before.

- he continually gets distracted and loses himself in thought. Bit like he usually does where he blankly stares at the screen and clinks his ring before coming back. No he’s staring off and clearly really bothered by something, he looks like he might cry.

What was it that got him upset? It’s clearly not Jessica since she was mad and freaking out at him last night and he ignored her and just said “I’d wish you’d shut up” and called her a bitch. That was when they were in bad terms and today they’re on good terms so he’s clearly not bothered by that.

- this one was overlooked and I can’t believe people haven’t been theorizing about this all day. Josh says and this is an exact quote “I appreciate everybody being patient with me as long as they can… or could. I just wanna say I’m sorry.”

That really settles it. He said he appreciates people being patient with him as long as they can but quickly corrected himself and said as long as they could. Past tense. that reveals two things A) the people who were patient with him no longer are and B) they are unequivocally done with him and he doesn’t see a way to fix the situation because he corrected himself as if he misspoke when hey implied they still were. Something happened and it brought a sense of finality. If he had just gotten a complaint and a stern talking to he wouldn’t have felt the need to correct himself about people being patient with him, he would thank them for continuing to be patient with him.

Not just that but he apologized and it wasn’t a backhanded “I’m going through the motions but I’m not sorry” apology it seems like a pretty sincere apology. How often does Cobes sincerely apologize and (somewhat) take responsibility rather than coping and making excuses?

He also didn’t go live on his birthday which is his prime time paypig day. Could it be he was having so much fun with his gf? His dad? Or could it be he’s hurting and in a tough spot and has to maybe find a place to live for April?

Maybe it’s nothing. Maybe I’ll just look like a retard but I I think my detective skills are tight and if my hunch is correct it won’t be very long until we find out that “well, a lot’s up, Josh.” For the second time
 
Back