Claude: papers please?

SuspciousCarrot78@lemmy.world · edit-2 9 hours ago

Well, this is going to freak you out, because I am (literally, right now) explicitly scoping out offline YouTube integration into Jellyfin, as a sort of rolling library. Jellyfin has been good to me, but I’ve been using Nova Player for a while now, since my Pi borked itself (Nova player is plug hard drive into router, install app on TVs, done). The limit is that yt-dlp doesn’t integrate very well with it. I mean, I could build something, or fork the repo myself…or I could just use what already exists.

So it might be time to restore the entire *arr stack.

The TL;DR: I want one front end for ALL my media - YouTube, instructionals, movies, TV shows. That immediately speaks to Jellyfin, which I’m very familiar with. The issue is YouTube. There’s too much slop on there, I want a curated experience for the kids, SmartTube won’t work forever, and the eldest is starting to go black-hat and screw around with settings. That’s accelerating the timeline.

The stack I’m scoping:

Jellyfin - front end for everything
Tube Archivist - YouTube archive, metadata, download manager
Tube Archivist Jellyfin plugin - maps channels as Shows, videos as Episodes, bidirectional playback sync
*The usual arr stack (Sabnzbd, Sonarr, Radarr, etc.) - for maximum yarr me hearties. I’ve been downloading from 1337 like a pleb.
Handbrake (+ usual media ripping stuff from DVD as needed)

The YT stack: rolling library logic:

Core “keepers” - permanent, protected, not touched by auto-delete
TA rescans subscribed channels twice a week
Auto-delete watched videos after N days, per channel, marks them ignored so they don’t re-download
Whole thing surfaces in Jellyfin as a YouTube-style shelf

Scoping the maths at 200GB, 30-min average per vid, using compressed modern codecs:

Planning numbers per video: assume average video is 30mins. At 360p, that’s ~100MB per video. 480p ~160MB, 540p ~220MB, 720p ~320MB.

If I have a selection of “core keepers” at 720p H.265 (~300 videos), taking up ~80GB, that leaves ~120GB for the rolling pool:

Rotating quality	Rotating count	Total library
360p	~1,200	~1,500 (garbage; ok for kids cartoons)
480p	~750	~1,050 (surprising ok)
540p	~545	~845 (good to my eyes)
720p	~375	~675 (very nice.)

I don’t need 4K…hell, 1080p is wasted on me. So I’m thinking… 300 core vids at 720p + rolling library at 540p = 845 videos, give or take. More than enough to keep the fam off my back once SmartTube goes tits up (they can’t play whack-a-mole for ever).

I would prefer a clean migration to other, live sources (I have those scoped out as well) but not all the Minecraft / gaming / pretend play / blah blah stuff the family watches is on Peertube/Odysee/Curiosity Stream.

PS: I see your 480p and raise you 60, because 540p is the forbidden resolution :)

PPS: I was planning on using JF for music too…but maybe I should look at Navidrome like you said.

The crazy idea that I had was to use AI to create an infinite playlist of sorts. Seed it with your own music, get it to generate tracks in THAT style as filler, intermingle them (so there’s always something new).

Finish off with AI DJ’s that pulls in “local news” from your curated RSS feeds.

Think: Three Dog from Fallout 3.

Basically what I spoke about here -

https://lemmy.world/post/43936980/22784324

I have a pretty clear idea of how to get that done. It could be amusing.

https://huggingface.co/ACE-Step/acestep-5Hz-lm-0.6B

SuspciousCarrot78@lemmy.world · 12 hours ago

Well, there’s a quick check. Take a shot of whisky and I’ll see if can type “gottle of geer”.

SuspciousCarrot78@lemmy.world · edit-2 12 hours ago

Here. Begin your walk down the path to a power some would call…unnatural

https://get.kiwix.org/en/solutions/applications/kiwix-server/

https://get.kiwix.org/en/solutions/hotspots/kiwix-hotspot/

https://get.kiwix.org/en/solutions/hotspots/imager-service/

SuspciousCarrot78@lemmy.world · 13 hours ago

I really like those Tesla cards. I picked up a Telsla P4 (it’s all I can fit into my 1L shoebox) for $40 USD. Yes, you have to do some jiggery pokery with Noctura fans and zip ties but…again…$40.

The P40s are fantastic value for what they are.

SuspciousCarrot78@lemmy.world · 20 hours ago

I agree.

God help me, I’m actually reading books again.

Books.

It’s…harder than it use to be. A lot harder, actually.

But there’s something to be said about marginalia etc.

SuspciousCarrot78@lemmy.world · edit-2 18 hours ago

Exactly so. Mom - can we get the internet? Mom: we have the internet at home.

Batteries? I don’t need batteries. I have the never-ending warm glow of weaponized autism. And that’s not even a joke.

I tend to hyper-fixate on something until either it breaks or I do. It’s usually 70/30 in my favour :)

SuspciousCarrot78@lemmy.world · 20 hours ago

I’m right there with you…but may I offer an alternative narrative in two parts and then address the pipeline issue you raise.

The first part:

There’s a small (but real) subset of people turning their back on big corpo. Retro-tech, dumb-phones, self hosting, linux, right-to-repair advocates, OSS and FOSS, privacy groups … everyone can smell the enshittification and are (in their own ways) pushing back. That’s not nothing.

I think the way forward is not to play the game. Big corpo will do what big corpo always does. But we can use the tools we have to make the things we want.

Will it compete with SOTA? No. But…does it need to? At an individual level, I’d argue “probably not”. It just needs to work for the individual.

More to the point, there’s something to be said about doing more with less. Constraints can bring about real innovation. If the answer cannot be “Throw more X at it” (where X is $$$, compute, whatever)…then how can you leverage the tools and intelligence you have to build what you want? I think that’s the real question.

Now for the second part:

So for me the big question is, what’s our call on a possible (likely even?) future where we are forever stuck using cloud provided AI along with all of its negatives, in the same way > that basically all of us has been and still is stuck using MS windows, Google and the big-social-media hellscape?

I’m more sanguine about it because I think this is down to the individual. Look at where you are now - it’s not Reddit or Facebook :). You and I choose to be here because…reasons. We can choose to run Linux, LibreOffice, Mullivad, llama.cpp, SearXNG, Syncthing, Immich etc for the same reasons.

I think the trick will be figuring out how to navigate from your home ecosystem into the wider world, without getting f’d in the a.

The one thing I don’t have a clean answer for is your pipeline point. If the content web collapses into AI slop - and it’s already going that way - then the human-generated signal that makes these models worth using starts to degrade. You may need to hold onto your “Good Old LLMs” for a while yet (or start training your own from scratch. There are ways and means but that’s beyond the scope of this conversation I think).

In any case, individual sovereignty doesn’t fix that. You can opt out personally and still live in a world where the epistemic commons has been strip-mined.

That…probably what WILL happen, come to think of it. Ok, fine. But partial answers already exist - cryptographic provenance of human content, federated communities being structurally harder to slop-flood (maybe).

Honestly? Nobody has solved that problem just yet. The people building the biggest models know it’s a problem and don’t have a clean answer either. Anyone who says they do is selling something.

All I can say is the only way to win is not to play the game. Which WORP would no doubt meep-morp at.

SuspciousCarrot78@lemmy.world · edit-2 9 hours ago

The intranet becomes the internet :) Everything is local, accessible from multiple devices within my wLAN. The main box plugs into the router and serves everything over Wifi to trusted devices - my documents, media, books, games etc.

I wrote (flippantly) about the bones of the system here, 3 or 4 months ago. It’s more complex now, but the endgame has always been “what if cloud, but you are your own cloud?”

https://lemmy.world/post/41315607/21438607

It may not be fresh (if the net goes down), but it would be local. The only real question I have to grok for myself is if I want to mirror curated section of Wikipedia, books etc.

https://en.wikipedia.org/wiki/Kiwix

Probably I should. May as well go full data-horder. Good excuse to get a few more TB of storage. What I’ve done so far is all within 4TB, using clever tricks and black magic but there’s a limit to 4TB. Fortunately, hard drives are still not too $$$. +4TB is about $200 here locally. So the entire set up is still around $600-700 AUD (around $350-400 USD)

All the other stuff I have more or less tee-ed up (barring the UPS + solar kit I am building later in the year).

https://www.youtube.com/watch?v=1q4dUt1yK0g

Anyhow, 8TB local should just about cover what I have in mind (he said, fully aware of dragon horder sickness). Then I’ll grab something for offsite storage for critical docs - I have an old raspberry pi with a 256GB NVMe ssd I can use for that.

I’m semi tempted (because fuck it, why not have fun) to look into LoRA after that.

https://en.wikipedia.org/wiki/LoRa

What I really NEED to do is finish the LLM stack (I’m on it and nearly done) and then do a curated Youtube replacement with yl-dlp feeding into Nova-player or Jellyfin, once/if SmartTube etc gets shit-canned. The youtube thing I’m kinda excited about because I’ve figured out how to squeeze ~1500 videos in around 250ish GB of storage, with TTL (time to live) mechanics, download replacement schedule etc. The kids watch too much random shit on YT, so daddy will make YT at home (ha!).

I have some other wild ideas too…it’s a whole other thing…don’t get me started :)

Once I’m finished, I will open-source the entire thing, post about it here, and let others replicate / improve on it. And so it goes. Once you begin walking down the dark path, you are forever doomed. Be careful :)

SuspciousCarrot78@lemmy.world · 22 hours ago

Hmm. The R9700 is RDNA4 - ROCm support for that architecture may be patchy in linux? Dunno. Check that before you commit your hard earned dollary-doos.

If all good

Qwen2.5-Coder-32B fits comfortably and is genuinely capable.
Qwen3.5-27B (dense)
Qwen3.5-35B-A3B (MoE, only 3B active parameters)
Qwen3.6-35B-A3B just dropped

Qwen 3.6 is the latest hotness. I’d start from there and work backwards

https://inv.nadeko.net/embed/YKNvkBbRJIE?

https://www.youtube.com/watch?v=YKNvkBbRJIE

SuspciousCarrot78@lemmy.world · 23 hours ago

How much VRAM do you have?
Which GPU?
What sort of coding do you want to do?

No point in telling you “yo, dude, just grab MinMax 2.7 or GLM5.1”…unless you happen to have several GPUs running concurrently with a total combined VRAM pool of 500GB or more.

There are strong local contenders… (Like Qwen3-Coder-Next but as you can see, the table ante is probably in the 45GB vram range just to load them up. Actually running them with a decent context length is likely to mean you need to be in the 80-100GB range.

Do-able…but maybe pay $10 on OpenRouter first to test drive them before committing to $2000+ worth of hardware upgrades.

There are other, more reasonable, less hardware dependent uses for local LLMs, but if you want fully local coders, it’s the same old story: pay to play (and that’s even if you don’t mind slow speed / overnight batch jobs).

Right now, cloud-based providers are hemorrhaging money because they know it will lead to lock-in (ie: people will get use to what can be achieved with SOTA models, forgetting the multi-million dollar infrastructure required to run them). Then, when they realize you can’t quite do the same with local gear (at least, without spending $$$), they can ratchet the prices up.

Codex pro-plan just went to $300/month.

We’ve seen this playbook before, right?

SuspciousCarrot78@lemmy.world · edit-2 22 hours ago

Sadly…none. Well, I mean…it depends what you mean by “coding”. If you mean “replace Claude with local?”. Then…none. Sorry.

If you mean “actually, if I use ECA to call a cloud model from OpenRouter for planning, then have it direct a local LLM to do the scutt work”, then the Qwen series of models (like Qwen 3 Next) are pretty awesome.

The iGPU will make you want to kill yourself though. Get a GPU :) Even a 4-16GB one can make a difference.

PS: You said GPU and iGPU, so I’m not sure which one has the 32GB or what rig your running. I have suspicion though you’re running on a i5 or i7 with something like a intel 630 igpu inbuilt? In which case, the iGPU is pretty slow and depending on the exact chip, you likely won’t be able to use CUDA or Vulkan acceleration.

So, the “get a GPU” thing still holds :)

SuspciousCarrot78@lemmy.world · edit-2 23 hours ago

Yeah, me too :)

https://bobbyllm.github.io/llama-conductor/

https://codeberg.org/BobbyLLM/llama-conductor

I’m thinking about coding a >>cloud side car at the moment, with the exact feature you mentioned…but…that’s scope creep for what I have in mind.

Irrespective of all that, I agree: an open cloud co-op could be a good way to have SOTA (or near SOTA - GLM 5.1 is about as close as we have right now) access for when needed.

(Not teaching you to suck eggs, so this comment is for the lay-reader):

For coding, you can do some interesting stuff where the cloud model is the “general” and the locally hosted LLM is the “soldier” that does the grunt work. We have some pretty decent, consumer-level-hardware runnable “soldiers” now (I still like Qwen 3 coder)…they just don’t quite have the brains to see the full/big picture for coding.

SuspciousCarrot78@lemmy.world · 1 day ago

I’m not sure that solves the issue or just changes the actors. Still, I’m all for “fight the power”.

I’m just a silly man with a box of scraps. But I hope enough silly men with boxes can come together to form some sort of co-op. Maybe. I don’t know. But…I hope, people smarter and better resourced than I can find a way forward.

The writing is on the wall here.

SuspciousCarrot78@lemmy.world · 1 day ago

I hope you’re wrong. I’m worried that you’re probably not.

Still time. Just barely.

SuspciousCarrot78@lemmy.world · 1 day ago

Claude: papers please?

SuspciousCarrot78@lemmy.world · 2 days ago

Hope it helps.

PS: you mentioned they had an Iphone? I’m not an apple guy but is there a way to share things to their Icloud from your computer?

SuspciousCarrot78@lemmy.world · edit-2 2 days ago

You mean the link I gave? Limewire isn’t “limewire” any more. It’s a drag and drop sharing website.

Drop a file, give them the URL, all good. No installation required. I’ve used it a few times. Like I said, probably don’t upload state secrets there but to share a MP4 or something, might be OK.

SuspciousCarrot78@lemmy.world · 2 days ago

I dunno if I’d trust it with anything sensitive, but this might be something

https://limewire.com/

SuspciousCarrot78@lemmy.world · 3 days ago

Called it.

$300 fucking dollary-doos, my son. Yeah, nah.

SuspciousCarrot78@lemmy.world · edit-2 4 days ago

Three replies deep and you’ve been wrong every single time. Confused Elite with Elite Dangerous. Invented a claim nobody made. Moved the goalposts thrice. Failed to comprehend both jokes and basic geometry.

And now that you’ve run out of thread to misread, you’re resorting to ad hominems and hoping nobody scrolls up.

They will.

SuspciousCarrot78@lemmy.world · edit-2 4 days ago

How about you reread the thread instead, see that it’s about accurately reproducing existing stars, and realize that you indeed have a comprehension problem.

The sub-thread is about the minimum storage to hold a 3D model per star. Starman defined a 2-byte tetrahedron and multiplied. That’s storage math, not astrophysical reproduction.

Nobody at any point said “accurately reproducing existing stars.”

Procedural generation is relevant because it’s the canonical example of compressing astronomical-scale data into almost nothing - which is what Braben did in 1984, on the machine I cited, which you initially corrected me on incorrectly.

You’ve now moved the goalposts twice: first from Elite to Elite Dangerous, now from “minimal storage per model” to “accurately reproducing existing stars.”

At some point it’s easier for you to just re-read the thread than to keep inventing new arguments to lose.

Go away.