• 9 Posts
  • 31 Comments
Joined 9 months ago
cake
Cake day: October 23rd, 2023

help-circle









  • Prompt:

    I’m currently trying to show on the Website Beehaw, that certain LLMs are far superior in writing than others. Examples of what bigger models do better than smaller ones: *

    Mistral-7B-Instruct-v0.1

    • ntire articles* vs Headlines Descriptions vs Product titles *Bul

    GPT 3.5-Turbo doesn’t support completion as it’s for chat, so I used an even worse one, text-davinci-003 which is far behind state of the art.

    Bigger models are able to handle more complex and detailed tasks with ease

    • Bigger models are better suited for natural language understanding and text processing
    • Bigger models are able to learn more accurate representations of context, thus improving the precision of the output
    • Bigger models can process data more quickly and efficiently, saving time and processing power when large volumes of data are used
    • Bigger models can better recognize more subtle nuances in language, which allows them to produce more accurate results
    • Bigger models are able to use more sophisticated algorithms, resulting in a more comprehensive and deeper understanding of the data being used

    Mistral 7B might be okay for some very specific cases, but it’s not comparable to proper models at all.

    edit: gave it a second chance, it’s a bit better (at least no complete nonsense anymore), but still terrible writing and doesn’t make much sense

    Paraphrasing The ability of a language model to generate text that has a similar meaning to the original text is called paraphrasing. This is a very common problem in natural language processing, and many LLMs are designed to be able to paraphrase text. However, there are some LLMs that are particularly good at paraphrasing, and these models are often preferred over smaller models because of their ability to generate more varied and unique text. Examples of LLMs that are known for their paraphrasing abilities include GPT-2 and transformers. These models













  • They believed that the AI safety work they had done was insufficient.

    Considering that every new model seems to be getting worse for anything but highly sanitized corporate usage, I’m not sure that I want more AI safety …

    For my usage, I use Chat GPT 3.5 turbo with the march checkpoint because I can’t get the current one to stop moralizing about bullshit instead of doing what it’s supposed to (I run two twitch bots with it). GPT4 used to be okay there, but the new preview is now starting to have the same issue with more frequent “I can’t do that Dave”-style answers, though it’s still mostly circumventable with enough prompt massaging, but it is getting harder.

    In a year, I don’t see anything but self-hosted models usable for anything not corporate glitz if trajectories hold, so fuck all that AI safety.