Honey, I Shrunk The Vids v2.1.2 - Now in CLI flavour!

obelisk_complex@piefed.ca · edit-2 21 hours ago

Honey, I Shrunk The Vids v2.1.2 - Now in CLI flavour!

obelisk_complex@piefed.ca · 2 days ago

Yes, and it’s actually pretty good at it. The code won’t be the most efficient, it won’t be elegant or beautiful… but it will mostly work, and someone with technical experience can get it over the line. Case in point: I can “sort of” code, but my career has been spent writing simple scripts. Nothing more complicated than workstation provisioning, find and replace with some regex, PowerShell with a WinForms GUI, etc. Despite being relatively low level in terms of actually building applications, I’ve been able to “project manage” and hand-edit Claude output into a working application. It’s basically just a frontend for FFMPEG, with some smarts and automation built in. Not particularly impressive in absolute terms, but it’s a lot snappier and prettier than anything else I’ve ever put together and I’m proud of it. I got it from concept to working in a few days, and added major features plus a few efficiency passes and bug fixes in two weeks - an absolutely incredible pace.

This comment is going to get absolutely nuked with downvotes, I guarantee it - but that won’t change the fact that I’m successfully building stuff with AI.

obelisk_complex@piefed.ca · 2 days ago

I spent quite a lot of effort getting Stoat up and running because they aren’t working on the selfhosted version, only to get a nice email from the German government that my server was running an outdated version of React with RCE vulnerabilities. Nuked that stack at 3am.

Also I fixed their Tenor integration to be provider agnostic so the self-hoster could choose a different gif provider like klipy (Tenor turned off their API so gif search in Stoat is broken), tried to contribute that one small change back to the main project, immediately rejected because “we have no plans for klipy support”.

Not worth the effort, IMO.

obelisk_complex@piefed.ca · 3 days ago

deleted by creator

obelisk_complex@piefed.ca · 3 days ago

I’m right there with you, bud. I tried StoatChat too, and I got a nice email from the German government about using an outdated version of React with RCE vulnerabilities. I think this must be a very difficult problem to properly solve, given the number of different approaches and how all of them have their own issues to contend with. Nextcloud Talk is the most usable option I’ve found because it does voice, video, and screen sharing and it also has call links you can send for unregistered people to join the calls. But performance is spotty even with the “high performance backend” set up (that may be due to my server being in Germany though 😅).

As to being accused of using AI, don’t let it get to you. The people yelling the loudest can’t tell the difference between handwritten code and AI, because they can’t code. If you pull down your repo, you’ll be depriving people who might be able to use your project because of trolls who never would have tried it in the first place.

I do use AI for coding, and I’ve gotten plenty of hate for it, but also people who don’t care and just want the functionality of the tool I built.

And in fact I’m going to check out your project and see if I can get it up and running, so please don’t take it down. I’ll likely be putting it on my German server so I’ll let you know what the performance is like with extreme round trips 😁

obelisk_complex@piefed.ca · edit-2 3 days ago

Would be easier to contribute to XMPP or Matrix IMO.

Synapse is in the middle of a rebuild without much compatibility between the legacy and new builds, and it’s a pain in the dick to set up at the moment. I know, because I did it.

XMPP I haven’t tried to set up yet, but I imagine it to be similarly in-depth.

As to why not contribute: edit: not AI, they just don’t have the confidence in their own skills to contribute to anyone else’s project.

Now… why do the whole thing from scratch instead of forking? Great question. XMPP might just need a nice coat of paint, if it can handle voice and video and screen share; I haven’t come away with great impressions of matrix/synapse.

obelisk_complex@piefed.ca · edit-2 3 days ago

We’ve practically exhausted the Exploration and Expansion phases

The Ocean and Space both called, and they disagree

Edit: damn okay we’re done exploring and there’s nowhere left to colonise, my mistake for making a throwaway joke comment on such a serious topic 🙄

obelisk_complex@piefed.ca · edit-2 5 days ago

Edit: Screw it, not worth it. Blocked, please return the favour, cheers ‘n’ thanks.

obelisk_complex@piefed.ca · 5 days ago

When it comes to the usage of both words, that difference you listed is completely arbitrary and obviously irrelevant

What? No. Software is something people go looking for and choose to download, unless we’re talking about malware which I think is fair to say is obviously outside the bounds of this conversation. Spam emails are forced on people without their asking or looking for them. They’re not at all interchangeable or the same thing.

Most people don’t care how their software is written, just like they don’t care how their food is actually made. And by “most people” I don’t mean you or anyone else here on Lemmy, I mean the majority of people who use computers. You wouldn’t believe how technically illiterate and uncurious the average person is - that’s who I mean. Those people hate spam emails, but they don’t care if their email app was vibecoded with AI. They don’t even know the difference between AI code and hand-typed human code, and most of 'em probably think “more code is better so AI is better!”.

Unless you’re trying to argue something else; that the slop in this specific case is more justified.

Sort of. I’m saying that while I understand why AI disclosures are a good thing, I think that if a person is not paying for an application and they’re not contributing to its development, then that person can keep their opinions on the development process to themselves. They can take those opinions and go build something of their own to satisfy them.

it’s the eagerness to treat users as braindead trash undeserving transparency.

I simply don’t think that’s a fair characterisation, because it ignores how people treat the developers who use the tools in the first place. People who have no technical skills whatsoever are happy to loudly shit all over said developers and call their work garbage - work they’ve been doing for nothing.

I agree the initial response could have been approached better, but all of us have the benefit of judging in hindsight and from a distance. I can understand how their emotions got the better of them, while under fire like that. This looks distinctly different from the BookLore fiasco though, where the dev is trying to close up the source in retaliation.

I just wish people would find more reasonable targets for their ire, instead of rolling with the pitchforks-and-torches mentality. Individuals building open source software are not usually reasonable targets. I do think “good thing it’s easy to fork open source” is the right sentiment; this is why anything I build, I put up under the Unlicense, because as far as I’m concerned any utility someone can get from it is to the good.

obelisk_complex@piefed.ca · 5 days ago

Oh, so it’s okay for you to have reasons for doing things other than fighting the system, but nobody else gets that consideration? Here in the US, we can be summarily executed for speaking out, and we’re doing it anyway. Quit being insufferable.

obelisk_complex@piefed.ca · 5 days ago

Yikes. Hadn’t heard about the openclaw use. That stack scares the bejeezus out of me.

obelisk_complex@piefed.ca · 5 days ago

deleted by creator

obelisk_complex@piefed.ca · 5 days ago

Precisely this, yes, well said. We all stand on the shoulders of those who came before us, one way or another.

obelisk_complex@piefed.ca · 5 days ago

The difference is that marketing is trying to sell you things to part you from your money. How much does Lutris cost? Yeah, it’s free and open source, so the motivations are completely different - including the motivations for using AI.

obelisk_complex@piefed.ca · 5 days ago

And here you are, commenting on social media, instead of doing something about it.

obelisk_complex@piefed.ca · 11 days ago

Sure, but reading the article, I think he might be knowledgeable enough. His mistake seems to have been blindly trusting the keys to the kingdom to an enthusiastic junior dev who’ll be very sorry if they nuke your system, but won’t think to do a damn thing to make sure it doesn’t happen in the first place…

obelisk_complex@piefed.ca · 11 days ago

obelisk_complex@piefed.ca · 12 days ago

lol I’ll take that as high praise, as everyone knows the 1990s were the peak of our civilisation!

obelisk_complex@piefed.ca · 12 days ago

Yeah, I’m getting that; though this isn’t purely AI-generated. This is a working application that I’ve tested, have improved and plan on continuing to improve, and am currently using to transcode my media. There’s a lot more care and thought put into it than most people would expect on reading that it was created with the help of an AI model.

I put the disclaimer because I respect that serious developers who actually go look at the code would like a heads-up that it’s genAI before they waste their time reading it. But, I would like people to at least have a chance to read why I think my approach is different than most.

And, if you have videos to transcode, I’d love to hear what you think if you give it a go! I do actively fix bugs as well as add new features, so please do let me know if you try it and find an issue - I could use all the help testing it I can get 'cause my hardware to test on is quite limited.

obelisk_complex@piefed.ca · 12 days ago

I was hoping to catch this before your replied, as I went and read the readme, then it made more sense. So I deleted my reply. But too late!

All good! I’m actually enjoying talking about this thing with people who want to know more so I don’t mind at all ^_

The cool thing is there isn’t much to put into a command that does stuff like this, unless you changing the FFMPEG parameters every time, but that would seem unlikely.

So actually, that’s exactly the issue I was running into! I’d run a batch command on a whole folder full of videos, but a handful would already be well-encoded or at least they’d have a much MUCH lower bitrate, so I’d end up with mostly well-compressed files and a handful that looked like they went through a woodchipper. I wanted everything to be in the same codecs, in the same containers, at roughly the same quality (and playable on devices from around 2016 and newer) when it came out the other end, so I implemented a three-way decision based around the target bitrate you set and every file gets evaluated independently for which approach to use:

1. Above target → VBR re-encode: If a file’s source bitrate is higher than the target (e.g. source is 8 Mbps and target is 4 Mbps), the video is re-encoded using variable bitrate mode aimed at the target, with a peak cap set to 150% of the target. This is the only case where the file actually gets compressed.

2. At or below target, same codec → stream copy: If the file is already at or below the target bitrate and it’s already in the target codec (e.g. it’s HEVC and you’re encoding to HEVC), the video stream is copied bit-for-bit with -c:v copy. No re-encoding happens at all - the video passes through untouched. This is what prevents overcompression of files that are already well-compressed.

3. At or below target, different codec → quality-mode transcode: If the file is at or below the target but in a different codec (e.g. it’s H.264 and you’re encoding to HEVC), it can’t be copied because the codec needs to change. In this case it’s transcoded using either CQP (constant quantisation parameter) or CRF (constant rate factor) rather than VBR - so the encoder targets a quality level rather than a bitrate. This avoids the situation where VBR would try to force a 2 Mbps file “down” to a 4 Mbps target and potentially bloat it, or where the encoder wastes bits trying to hit a target that’s higher than what the content needs.

There’s also a post-encode size check as a safety net: if the output file ends up larger than the source (which can happen when a quality-mode transcode expands a very efficiently compressed source), HISTV deletes the output, remuxes the original source into the target container instead, and logs a warning. So even in the worst case, you never end up with a file bigger than what you started with which is much harder to claim with a raw CLI input. The audio side has a similar approach; each audio stream is independently compared against the audio cap, and streams already below the cap in the target codec are copied rather than re-encoded.

But yeah everything beyond that was bells and whistles to make it easier for people who aren’t me to use it haha.

I am 100% looking for more stuff I can build - let’s talk about it!

obelisk_complex@piefed.ca · 12 days ago

Thanks mate! It’s been a rough as hell week at work and getting it when I’m trying to share my hobby work with people was unexpected and a little demoralising, so your comment is really nice to read and much appreciated 😊

obelisk_complex@piefed.ca · edit-2 12 days ago

Honey, I Shrunk The Vids [Mr. Universe Edition] v1.0.9

obelisk_complex@piefed.ca · edit-2 12 days ago

Honey, I Shrunk The Vids - a Windows transcoding frontend for FFMPEG

obelisk_complex

Honey, I Shrunk The Vids v2.1.2 - Now in CLI flavour!

Honey, I Shrunk The Vids v2.1.2 - Now in CLI flavour!

Honey, I Shrunk The Vids [Mr. Universe Edition] v1.0.9

Honey, I Shrunk The Vids [Mr. Universe Edition] v1.0.9

Honey, I Shrunk The Vids - a Windows transcoding frontend for FFMPEG

Honey, I Shrunk The Vids - a Windows transcoding frontend for FFMPEG