Tokenizer

That, in the forums! It’s a bird! It’s a plane! It’s THE LINKTEXT!!!
I could make this more efficient but I really just worked as I went on

It works. But has a bit of an error.

What error?

The tokens seem to separate randomly and un organized.

oh its because it finds the most common pair of characters/tokens and makes it a token

I think another error would be for it to keep merging the beginning of the text and the next token after a few iterations

Wow, are you going to make an AI only using :snap:?

That’s already been done! Look at this project by @jens:

Peak story:

Yeah, It’s not very good, but still very interesting, as it is based off 30 storybooks! Check the code!

That is a Markov Chain, not what you usually think when people mention AI.

I know, but it is still undeniably cool.

i unfortunately do not know