Looking for software please help

Currently reading:
Looking for software please help

internaut

New member
comfy friend
Joined
Jun 29, 2022
Messages
11
Reaction score
2
comfy coins
💠36,557
I have a database of over a million messages from a now deleted Discord server. The database contains: Every message (including system messages), all IDs of users (authors, mentioned), information of users (profile picture URL, name, discriminator aka tag, nickname, is a bot), URLs of attachements, stickers and lots of other misc things. Archive is in json format.

I want to make a word cloud of words over time, or some kind of interesting statistics with the messages. What software could I use?
Edit: I ended up making a chatbot based on a markov chain out of it.
 
Last edited:
Python's probably the best bet, but I bet you could do something in Processing. As someone who can't program to save my life I know exactly how unhelpful this is, though.
 
Yeah what you want to do is pretty specific and I'm afraid you'll have to write your own program, but since working on .json is pretty easy, I don't think that's gonna be much of an issue.
Alternatively you could provide a sample of the json and tell us what exactly do you want and maybe someone will be willing to work on this.
 
would you mind sharing what you used for that? i am interested
This code was made for an IRC bot but it's easy to modify it for anything you want.
To save you some time understanding how it works:
In order to run it, you must add the bot's username as a paramater: "cgbot.exe comfybot". When you run it, it takes input from stdin (your keyboard in command line). The first word of the input is the username of who is sending the message. The username must always be in one word. Following that is the message. The message ends with "\n"/breakline/enter. To make it learn and enable it to reply, someone with the username "###" must type "ENABLE" (you would type "### ENABLE" in the command line). That will trigger a rehash (relearn) and it will enable the bot to reply only when its name is mentioned. The message will be outputed to stdout (your command line). Markov paramaters are at line 23 and commands are at line 444.

I parsed all messages from the json file into a txt file following the "user message" format and I fed it in via fstream. At the end of the txt file I added an end indicator. I modifed the code to read everything that's in the file and when it reaches the end indicator it triggers a rehash, enables the bot and sends to stdout a "Ready" message and switches to taking input from stdin. I made it work with discord by making a bot in node js which interacts with stdin/stdout.
 
/pub/ ~ public channel
Help Users
  • N (Guest) Notxeraser:
    Oh yeah it works there now, I didn't use awave because it doesn't really have any useful formats for what I'm trying to do right now (though at this point I might as well get to work and Frankenstein an XS/XF version of this library)
    Quote Link
  • N (Guest) Notxeraser:
    Wasn't in my plans but might as well
    Quote Link
  • N (Guest) Notxeraser:
    I did try to convert the version from archive.org to something more usable like GIGA (played by g-player or by exs24) but that went horribly wrong
    Quote Link
  • N (Guest) Notxeraser:
    G-player under logic (at least 2.0) is buggy as all hell
    Quote Link
  • A (Guest) anon:
    If it works for you, I guess it's fine. It really did not look to me like there was anything wrong with the ISO, after the track mode change.
    Quote Link
  • N (Guest) Notxeraser:
    Oh no the iso is fine, thanks for that
    Quote Link
  • A (Guest) anon:
    If the archive.org version is the one I think it is, it's exactly the same as the rutracker one.
    Quote Link
  • N (Guest) Notxeraser:
    Guess it was just a very sloppy conversion job to get it to fit on an AKAI CD
    Quote Link
  • A (Guest) anon:
    And it looks like it's also the same as the magesy one, too.
    Quote Link
  • N (Guest) Notxeraser:
    anon said:
    If the archive.org version is the one I think it is, it's exactly the same as the rutracker one.
    it's half the size
    Quote Link
  • N (Guest) Notxeraser:
    Seems to have less programs too
    Quote Link
  • A (Guest) anon:
    I don't know, I have three or four versions I got from different places.
    Quote Link
  • N (Guest) Notxeraser:
    The archive.org one should be a little over 200mb
    Quote Link
  • A (Guest) anon:
    You are right that the archive.org one was smaller. It also was an ISO, and not a bin+cue.
    Quote Link
  • A (Guest) anon:
    Yup.
    Quote Link
  • N (Guest) Notxeraser:
    Yeah forgot to mention it was an iso
    Quote Link
  • A (Guest) anon:
    Anyway, the AKAI stuff itself does not look particularly wrong. If anything, I guess something went wrong when ripping it; but it's really strange that it would result in a cue file with the wrong track type.
    Quote Link
  • N (Guest) Notxeraser:
    What's Mode 2 used for anyway? Never gave it a single thought
    Quote Link
  • N (Guest) Notxeraser:
    with that said I haven't burned a CD in well over a decade
    Quote Link
  • N (Guest) Notxeraser:
    I think the last time I did it was to burn a copy of SSBB to a DL DVD for some reason
    Quote Link
  • A (Guest) anon:
    I'm not sure. I think it's something to do with CD-XA.
    Quote Link
  • A (Guest) anon:
    Essentially, mixing CD-ROM and CD-DA tracks in a single CD.
    Quote Link
  • A (Guest) anon:
    Not sure what mode 2 would have to do with skipping the first 8 bytes, however.
    Quote Link
  • OPdbx @ OPdbx:
    how to embed video file in comment? not just name of file...
    Quote Link
  • OPdbx @ OPdbx:
    not sure if my chats are being sent. is red normal?
    Quote Link
    OPdbx @ OPdbx: not sure if my chats are being sent. is red normal?
    Back
    Top