After having used the new model for over a month, mostly on AI Story Generator, and investigating on the old and new AI models used, I've reached to a conclussion that, in m opinion, makes sense.
The old model was Llama 2. Llama 2 (and Llama 3) are models feed on books, as in lots of literature. Meta licensed a LOT of them to train the models.
The new model is Deep Seek, or at least it seems to be so. We'll assume it is, but to be fair, it doesn't changes the argument a lot. DS has an issue, it is trained on normal content, say: internet, some books obviously, interations, etc.
Now, what's the issue with this?
Llama is a model that knows WAY better how a story works, having hundreds of them on its dataset and having processed them during its training. DS doesn't, DS is a more generalist model, thought more as an assistant than a story creator.
For the kind of usage done here, essentially either chatting with characters with AI-Character-Chat or writing a story with AI-Story Generator, the improvement in context and general knowledge DS gives is not worth the decrease in narrative quality, and understanding of story writing. That's not mentioning all the hallucinations, total ignoration of context and prompting, and similar the new model has.
Llama 2 is a way better option for the kind of usage we have. Yes, we would be lossing some general knowledge. Yes, it may not be the best AI model out there. But it's all things considered, it's a matter of chosing the best option for our use case.
I understand the dev does all this work alone, and appretiate his effort for it. That's why, as a really active user of this platform and service, I consider the best choice here is to return to the old model.
If you have some argument more for it, please add it in the comments. Thanks everyone for your time.
-Lucalis.
I agree. The new model isn't very good. The main issues I have with it are that it tends to break immersion, saying stuff like "BREAKING_BAD_PATTERNS" or talking about "breaking bad patterns" for no given reason whatsoever and has no context to the roleplay at hand. They also tend to speak like this all the time:
John nods, "I think we've got it under control," he said "but watch your back just in case."
Even if the initial message has asterisk roleplay such as this, characters added to that thread will still talk like the above example, asterisks have to be incorporated a LOT into the initial message (at least from what I observed) in order for the new characters to RP like that.
The new model isn't that bad though, as it is able to portray fictional characters almost perfectly. I don't really use the AI Story Generator much, I only use the AI Character Chat and occasionally the image generator. I'm saying this based on what I've observed from there.
I don't have as much experience with character chats, but the old model was perfectly capable of handling fictional characters. I mostly use it for story creation, and the decrease in narration quality is stagering