Doing the Lord’s work in the Devil’s basement

  • 0 Posts
  • 28 Comments
Joined 6 months ago
cake
Cake day: May 8th, 2024

help-circle




  • From reading your post it seems like you could be interested by the Jesus movement (that is the jewish followers of Jesus, before catholicism was codified and adopted by the Romans as state religion). Everything that wasn’t authoritarian fear-based catholic was branded as “gnostic heresy” and purged from the canon, but there’s some real good shit that is very close to the core message of Christ.

    A recent(-ish) example of gnostic christianity is catharism, which was a heresy that lasted for a few centuries in the South of France. They had no clergy, just a caste of ascetic wise men and women who would walk the land and dispense wisdom and judgement. Very egalitarian, very spiritual, very christ-like. As you can imagine, they got crushed in one of the rare “self-crusades” in history (meaning the King of France sent his own armies to burn down cities in his own country and murder thousands upon thousands of his own subjects). As you can imagine there is not one history teacher in France who will tell you about this episode.



  • Yeh, i did some looking up in the meantime and indeed you’re gonna have a context size issue. That’s why it’s only summarizing the last few thousand characters of the text, that’s the size of its attention.

    There are some models fine-tuned to 8K tokens context window, some even to 16K like this Mistral brew. If you have a GPU with 8G of VRAM you should be able to run it, using one of the quantized versions (Q4 or Q5 should be fine). Summarizing should still be reasonably good.

    If 16k isn’t enough for you then that’s probably not something you can perform locally. However you can still run a larger model privately in the cloud. Hugging face for example allows you to rent GPUs by the minute and run inference on them, it should just net you a few dollars. As far as i know this approach should still be compatible with Open WebUI.









  • I suspect a lot comes from the ingredients being mediocre when you buy them at high demand periods.

    I come from the French country side, my father raises poultry and makes his own foie gras and deli meats. When I see the shit they sell at Christmas, which most of my fellow countrymen eat every year… I wouldn’t be surprised reading a comment similar to yours about French Christmas food.

    Maybe your grandma can’t afford the good stuff, or doesn’t have access to it ?






  • Very useful in some contexts, but it doesn’t “learn” the way a neural network can. When you’re feeding corrections into, say, ChatGPT, you’re making small, temporary, cached adjustments to its data model, but you’re not actually teaching it anything, because by its nature, it can’t learn.

    But that’s true of all (most ?) neural networks ? Are you saying Neural Networks are not AI and that they can’t learn ?

    NNs don’t retrain while they are being used, they are trained once then they cannot learn new behaviour or correct existing behaviour. If you want to make them better you need to run them a bunch of times, collect and annotate good/bad runs, then re-train them from scratch (or fine-tune them) with this new data. Just like LLMs because LLMs are neural networks.