These tests are using quantizations of whisper-large-v3-turbo from
here. These are all one-off tests on the audio from this
Summary of 2025's legal battle
View attachment 8405031
I posted this already in the Legal thread but then I realized they probably wouldn't allow any discussion about it like you can have here.
FYI this is why my annual update is taking so long, I have pretty much every other aspect of Russell's antics logged and voiced, it's this court shit that goes over my head most times.
Obviously huge thanks to everyone linking the Pacer files, this is pretty much all you.
I was able to get my ARC card up and running without too much fuss. I've attached the outputs of all of them (they are srt format but I had to rename them txt because the site doesn't accept .srt files). I watched through one of the Q8 ones and it seemed basically correct. For all tests memory usage was essentially constant.
Q8_0
Tesla M40: 56s, 1.5gb, 120w
Ancient Xeons: 247s, 2.1gb, probably 400w or some shit
Arc A380: 94s, 1.3gb, ??*
Q4_k
Tesla M40: 49s, 1.7gb, 120w
Ancient
Aliens Xeons: 150s, 2.1gb, probably 400w or some shit
Arc A380: 81s, 875mb, ??*
Q4_0
Tesla M40: 38s, 1.7gb, 120w
Ancient Xeons: 111s, 2.1gb, probably 400w or some shit
Arc A380: 80s, 875mb, ??*
*intel gpu top doesn't report wattage for my card. It doesn't require an extra power connector so per to the PCIe spec it must be under 75w.
I don't have REBAR enabled but I don't think that will affect anything after the model is loaded. Out of interest I ran a test with the Q4_0 on a basic bitch 4c/4t skylake cpu and it took 560s. I also tried some much longer files and it didnt' seem to change the memory usage.
I'm pretty surprised that the low tier A series cards can do this. On top of that whisper.cpp is running on them via vulkan and in intel-gpu-top there's separate bars for "render/3d", "computer" and "video". I tried running a transcoding job at the same time and it seems like that's under "video" (and obviously vulkan is under "render/3d") so had no effect.
All in all it seems like dual AV1 encoding + whisper captioning seems feasible on ARC cards and if
@geckogoy is willing to send you one for free to test with I would take him up on that offer.