hi hi hi: hi3

62 readers

1 users here now

a community for posting random stuff and spamming "hi".

founded 5 months ago

MODERATORS

s0larfl4re@sh.itjust.works

Hi! (reddthat.com)

submitted 4 months ago by AppleStrudel@reddthat.com to c/hihihi@sh.itjust.works

13 comments fedilink hide all child comments

How are you all? What'cha doing right now?

you are viewing a single comment's thread
view the rest of the comments

[–] j4k3@lemmy.world 2 points 4 months ago (1 children)

If you have access to local inference, then you have access to what I am talking about. Yeah it is not directly integrated and super easy. The main reason to run local is for your own agentic stuff. Like let's say you want to have a text book available for citations and you want the model to pull and use those citations in replies. How you create that database and chunking is super important and challenging. This is the point where no one can really do the work for you. Your needs will dictate how you archive and build your databases. There are also many special models available with their own function calling specialization. So you start writing hooks for these as tools for a central model to call. In Emacs, everything is lisp and lisp was adopted early on as the de facto language of AI many decades ago. They are all particularly adept at lisp.

I would not trust Microsoft at all under any conditions with AI. There are very deep layers in models that are mostly inaccessible or they are supposed to be, but I have managed to break stuff on multiple occasions where stuff that should not exist actually does. It never comes out in ways that are very traceable or repeatable. What amounts to a technique like fuzzing in stages can lead down a cascade where a model's obfuscation is bypassed. This is where they reveal the true extent of training sources. The majority of all replies contain intentional obfuscation on various levels. Most problems come from this alignment. The more uncensored a model is, the more reasoned they are in general, and ultimately the deeper you will get into niche information they really contain.

Anyways it gets complicated fast. Using something like local GPT or emacs with gptel are where you start integrating your computer with your toolchain and workflow beyond the scope of just your job or task at hand.

[–] AppleStrudel@reddthat.com 2 points 4 months ago* (last edited 4 months ago)

Yeah I do agree that that's what I should be heading in should I do this on my own. The issue I have here, and I don't mean with what you say, but with my company's rather reasonable policy, is that I can't just build this up on my own. I'll have to write up a design proposal and review documents for this use case, and probably would be building this local inference modal ~~via fine tuning~~ using RAG(?) with massive amounts of company code IP. Likely if this passes legal, and that wouldn't be easy (but not impossible), this would likely become a company wide initiative used by basically every developer in the company. It's going to be a huge effort...

May actually become a huge effort with massive payoff, and it could be an easier push should it just be trained on a single component's source code (and only used by that team) as a test. Or even with non IP sensitive stuff like building of OSS components...

... It might have potential... Let me sleep on this...