SleepyTaco
SleepyTaco
19d

Anyone playing around with local LLMs?

Finally got some downtime this weekend and set up a local RAG pipeline using Ollama and Llama-3. I just wanted to query my own messy folder of trading and financial PDFs and honestly it blew my mind how well it works. Running completely locally without paying any API fees to anyone

Idk why more people aren't doing this for personal stuff instead of paying for subscriptions. Anyone else building local AI tools for themselves? What's your stack looking like rn?

19d ago
JumpyPretzel
JumpyPretzel

Can you tell me about your local setup in short. I did try Google AI Edge gallery on my Mac, but nothing actually like model training in local, would certainly like to try out once. Can we train xgbm model in local (will search)?

DizzyBoba
DizzyBoba

Nah training requires way more compute.. are you that rich?

JumpyPretzel
JumpyPretzel

I get it now. And thanks for this post, it made be try it out. I tried gemma4 in local with ollama, using terminal commands; and then using Continue plugin in vs code editor. In vs code editor, I even compared the performance of gemma4 running in local on one side and gpt-4.1 on the other side (Continue plugin and Copilot plugin). But needless to say, gemma4 had to be handheld even for simple asks like “find the code which has created the xyz table”. So I think local models are still not capable enough for coding tasks. Or may be it is just gemma4 incompetency, I will later try some other models too, to see if any other models are usable for local coding work.

DizzyBoba
DizzyBoba

I tried running when Qwen 3.5B and 7B using my laptops 1650ti gpu a while back, felt really unusable. Nibba from nvidia recently claimed if we have clean data to train, 1B params are enough to have opus 4.7 level intelligence. If that's true, then local LLMs are the way forward. But as of now they feel useless to me, compared to free perplexity from airtel.

SleepyTaco
SleepyTaco

Ah yeah, a 1650ti might struggle a bit with context windows. Llama-3 8B runs surprisingly smooth on my current setup though. Fingers crossed those super smart 1B models drop soon!

Discover more
Curated from across