I think this definitely seems interesting. I'd love to give it a try if you're still working on it.
LocalLLaMA
Welcome to LocalLLaMA! Here we discuss running and developing machine learning models at home. Lets explore cutting edge open source neural network technology together.
Get support from the community! Ask questions, share prompts, discuss benchmarks, get hyped at the latest and greatest model releases! Enjoy talking about our awesome hobby.
As ambassadors of the self-hosting machine learning community, we strive to support each other and share our enthusiasm in a positive constructive way.
Rules:
Rule 1 - No harassment or personal character attacks of community members. I.E no namecalling, no generalizing entire groups of people that make up our community, no baseless personal insults.
Rule 2 - No comparing artificial intelligence/machine learning models to cryptocurrency. I.E no comparing the usefulness of models to that of NFTs, no comparing the resource usage required to train a model is anything close to maintaining a blockchain/ mining for crypto, no implying its just a fad/bubble that will leave people with nothing of value when it burst.
Rule 3 - No comparing artificial intelligence/machine learning to simple text prediction algorithms. I.E statements such as "llms are basically just simple text predictions like what your phone keyboard autocorrect uses, and they're still using the same algorithms since <over 10 years ago>.
Rule 4 - No implying that models are devoid of purpose or potential for enriching peoples lives.
Sent it to in a DM because of all the downvotes here
Nobody's going to know, you haven't described what it actually does.
Sorry for the oversight! It's basically an AI radio app (with stations playing different music styles). There's even an AI DJ feature that's audience aware (think weather callouts for listener locations, audience polls etc)
Don't mind the downvotes, they're from people who don't even use this community.
Yeah I'm not disheartened, my mom said it was cool /s
In all seriousness, my wife and I think it's cool and I literally use it (all day) during the week while I'm working
If you're going to do lyrics don't do all the stations, leave some just to play the music.
Yup, I have instrumental only stations (and they're tagged as such so you can filter by those only)
You might not even know how to answer this, but can it use accelerators to do the generation?
so i'm using an open source model which leverages a 3090. I tried to make the app itself as agnostic as possible so I can just plug any API compatible server into each component (track metadata generation, actual song generation, AI DJ script, DJ script and so on). I figured making this thing as flexible as possible would be best overall
Edit: model is ace step 1.5
No.
You know before i read the post i was thinking of news or podcasts created in AI. But yeah music would be cool too.
Sounds interesting! Yes, please post it
Sent it to you in a DM because of all the downvotes here 🤕
Cheers! I will take a look. Weird how hostile Lemmy is to ai - especially LocalLLaMa. Think you got brigaded.
EDIT: Holy shit dude - that's amazing. Well done!
Thanks, I really appreciate it! Not all the stations have an AI DJ btw... I should probably add some visual queue in the station browser which shows whether or not one is set. Also, some of the AI DJ actions dont trigger if there arent enough people listening to a station (like polls for example)
Yeah, I just caught the tail end of a DJ announcement on the Island Vibes station. This is a great idea...but you will get murdered by the Lemmy / Reddit "this is AI slop" hivemind. I suspect those people haven't turned on the radio in their car any time recently; I'd rather listen to this tbh
PS: submitted a request for something like ambient LoFi girl.
PPS: if you've got AI DJs...can we expect AI podcasts or short segments at some stage? Lean into the whole Three Dogs (Fallout 3) vibe.
Saw your request and songs are queued for generation! The new station is called Cloud 9 Chill FM So it supports station jingles but the intent there was just for a "youre listening to so and so station" announcements (same as in regular radio).
I'm not sure about AI podcasts though... maybe like a news station would be useful
Yes, it would be useful, I think. You could for example source something from a RSS feed to turn into a news cast - just 2-3 items - as part of the the station support Jingles. You'd have to maybe ask Claude etc for some ideas (perhaps pulling different RSS feeds to match the station? The synthwave one might pull in arstechnia or something).
I'll keep and eye out for Cloud 9 Chill.
Is there a blog or some such you use to discuss the architecture of SynapseFM? Would be curious to know more.
Yeah that's actually a tool I was already working on creating for the AI DJ. basically pulls current news info based on audience location
Do it :) It would add a lot, I think. Though it introduces some complexity on your end if you have to geo-tag canonical feeds per user, per each location, to extract from; a few set ones (technology, science, world news etc) per each station might be easier...but then have the DJ announce in the voice of whatever that IP address is from?
Dunno. You're clearly more than capable of working it out, so I look forward to seeing what you do.
OK so, I'm almost done with the tool for the AI DJ. Basically we're using searxng to search for audience locale news + using rss feeds to provide the llm with context about specific news categories (Technology for example)
Nice! Sing out (ha ha) when it's done so we can try it.
You can definitely give it a shot now, it's done at this point! Keep in mind the llm can pick and choose between the various tools to call so news may not be called out each time
Nice :) I have it on right now. Might need a touch more reverb, though that could just be the track (“Silence Between Thunder and Lightning”). Definitely in the ballpark. Cheers for that.
I had an idea for you driving home, though it may introduce scope creep.
Have you considered a hybrid station mode where the user can supply their own music library and Synapse FM intermingles it with the generated tracks? For example, maybe the user uploads a playlist manifest plus files or points Synapse at a Google Drive / Dropbox folder containing MP3s and an .m3u playlist. Then the system could:
- randomly select from the user’s own tracks
- blend them into the generated station flow
- use metadata / embeddings / simple tagging to keep tonal consistency
- optionally let the AI DJ introduce those tracks as part of the same station identity
So instead of pure AI radio, it becomes something closer to:
“your own music taste, extended infinitely”
That feels like a pretty compelling hook to me...and might actually protect your from the haters.
Set it up so tracks are either played directly, or used as “station DNA” for selection / matching / transitions. Or both.
Or (and this is my preference) you could have it so that the scheduler inserts user tracks every N songs.
You could even allow users to tip the balance, user side:
- AI only
- Mostly AI
- Balanced
- Mostly my library
I'm handwaving away a lot here but even as a local/private beta feature for you alone, it seems like a genuinely interesting direction.
Again - scope creep / you might see it differently than I do. Still, even if you just play with it at home, try it and see if the idea works,
Just wanted to share, one journeyman to another.
It's a good project and you SHOULD post the URL here (I won't / am respecting your privacy).
Be proud of it, it's good work.
EDIT: Just caught the jingle between songs - well done! Exactly right.
I'd love to hear what local model you settle on for lyrics, I've been having a lot of fun with ACE-Step 1.5 but the lyric generator it's bundled with produces semi-nonsense lyrics that have nothing to do with what I prompt it with. Which is actually kind of fun in its own way, I literally never know what the song's going to be about, but I'd like a little control sometimes too. :)
Yeah I completely agree on the lyrics it can generate! That being said I actually haven't tried using the 4b lm model version just yet. I should probably give that a shot...
It's the one I use, so don't expect miraculous improvement. :)
I'm curious, do you have ADG enabled at all? How many steps do you use generally?
ADG == Audio-Driven Guidance? I haven't played around with that part much. I tried it out and couldn't get it to work, but it turned out that the reason ACE Step wasn't working was unrelated to that and I only figured out what was wrong after I stopped experimenting with ADG. So I haven't gone back to try it again.
I'm not really much of a music connoisseur, I just know what I like when I hear it. So mostly I just put together lyrics and then throw them at the wall to see what sounds good. :)
ADG for ace step is Adaptive Dual Guidance. try turning that on + make sure you have thinking enabled, I saw a big difference with both those on (more so thinking)
Ah. After poking around in the Gradio UI a bit, I found an "Enable ADG" but the tooltip says it's "Angle Domain Guidance", same thing?
I'm a programmer, but sometimes with AI I feel like a primitive tribesperson blindly attempting various rituals in an effort to appease the machine spirits. Eventually something works, and then I just keep on doing that.
Edit: I have angered the gods! My ritual failed! When I enabled ADG the spirits smote me with the following:
RuntimeError: The size of tensor a (11400) must match the size of tensor b (5700) at non-singleton dimension 1
Guess I won't be trying that for now. :)
naw, man.