I'll get to the interesting data aggregation side of things, but first, this titbit from earlier today:
You are - under no circumstances - to handle any weapon longer than your left arm, any activated firearm, bow, or slingshot device, or purchase any knives of any kind, whatsoever.
^ That is my new bday present to myself, a small knife used for gutting kills. So much more wieldly than that other large knife which I had been using for, well, EVERYTHING. In case you've never been to an arms fair, btw, here's what they look like:
Katanas. I only noticed how many of them their was when I wasn't allowed to hold them
tagged because he likes this kinda thing. I got a business card for a legit "Eastern, Islamic and Tribal" weapons craftsman, which will definitely be worth a visit next time.
ANYWAYS, on to the data
As some of you may know, I happened across a set of DA comments. The set was SOMEWHAT LARGE but very interesting. (All the data was in the public domain so don't start b**tching about privacy and whatnot because you knew this was going to happen 'cause you were the one that posted it). My original plan was to make a system that would be able to detect users who were feeling down, depressed or possibly contemplating suicide, and then report this to the user's friends to see if they'd be able to cheer them up. BUT, as I was developing it, the Samaritans suicide support group got bodyslammed by their user base, saying their privacy was infringed, so I decided not to argue, and dropped that project.
Which left me with an excessive amount of comment data, a lot of which was extremely weird.
Weird though it was, I've decided to put that data to good use to make some interesting word clouds. The database selected 30,000 random comments (any more took too long to filter), then filtered them based upon certain search terms. I did a few filters based upon content, who it was to, who is was from, and where it appeared, the results of which are available below. If anyone wants a wordcould generating for them, let me know, It's just a query away
The wordcloud with just the 30,000 comments processed:
Frankly I'm ashamed that 'like' was the number one term. Also surprising (or not) that Mewitty is on there. There's actually a lot to be learnt here. Firstly, that DA seems to like a lot of made up words. Secondly, you're a very mushy group: 'Love', 'Hugs' and 'Cute' all showed higher up than normal.
Comments containing the word 'Lucario'Executed SQL: WHERE `Text` LIKE '%Lucario%'
I wanted to test that the LIKE clause worked correctly and didn't take too long
Well I think it works. One thing to point out, MOST (approximately 80%) of the comments on DA are RP. This probably explains why odd names feature quite predominantly in the above. obvs 'chest' makes an appearence here because, well, Lucario has a spike in his. But (and I'm sorry about this), but people, STEP UP YOUR ADJECTIVE GAME. REALLY IS A S**T ADJECTIVE. If you MUST RP, then please, at least be creative.
Actually, that makes me think. What other pokémon should we put through this?
Comments containing the word 'Pikachu'Executed SQL: WHERE `Text` LIKE '%Pikachu%'
I'm expecting 'yellow' and 'Japan' to be the top two here.
'Crying'. REALLY? Actually, all of this is appauling. Plz, WTF, ya...
If anyone can explain why 'tube' and 'sphere' feature here, please, let me know.
Comments containing the word 'Love'Executed SQL: WHERE `Text` LIKE '%Love%'
Let's see. Yup, the cliche 'cute', 'good', 'dutiful' and 'great' have all made it in there. Cliche's exist for a reason, I guess. Whoever it was who used 'tits' enough to get that in there, kudos to you.
I wonder what happens when we only do comments I've written.
Comments authored by the user 'TheModeratorExecuted SQL: WHERE`From`='TheModerator'
Not going to lie, the word I use most is probably going to me 'Me', followed by 'I' and 'Personally'. Let's see.
Hahaha. 'Science' and 'Chemicals' are in the top, but the major one is, unsurprisingly, 'People'. What can I say? Other people have a LOT of problems. (Yes, that was ironic).
What about comments that are going TO somebody?
Comments sent to 'StreetDragon95'Executed SQL: WHERE`To`='StreetDragon95'
We all know this streetdragon95
s work. I'd bet £10 that 'Cute' is in the top three largest words on the page, along with 'Amazing'
Well, 'adorable' is a better version of 'Cute'. So I was close to 100% right. That aside, I think this cloud represents him pretty well. No comment on why 'porn' or 'Autism' is on there, though.
One last one, what about comments made on somebodies artwork?
Comments on work owned by 'snivylover4125'Executed SQL: WHERE`Owner`='snivylover4125'
I'll be honest, SnivyLover4125
was someone I had manually profiled during the Freeman Fiasco. Let's see how his cloud pans out.
Huh. Pretty much matches what I'd expect. Didn't expect 'tighthug' or 'hehehe' though, so you got those two over me at least. He also appears to get people thanking him a lot. Sounds like a nice enough dude.
So, yeah, this is what is ACTUALLY happening to all your data. I am using it to make pretty pictures with words. All of a sudden it's not too worrying, now, is it?
That said, if you want your wordcloud making for you or anything, just send me a note and I'll do it, ASAP and free of charge.
'till next time