From the details that Kson has revealed in her streams and WIP posts, it looks like she has chosen the Unreal Engine +
XSens consumer mocap stack which is the same setup that CodeMiko uses.
The tracking quality that these enormously expensive suits can deliver is as high as it gets for consumer grade hardware but integrating it into something you can stream is not anywhere near as easy or as cheap as it is with the most common vtuber full body mocap stack which is built on VR tech instead of specialized mocap tech (Unity + VRM + Vive Trackers stack that Cover and Anycolor both use).
I don't want to hype it up too much because in the end, Kson is still just an independent vtuber but the amount of money and resources she's pouring into this makes me feel like this is just a initial investment into something bigger like a 3D studio. It doesn't make sense to order expensive hardware and hire programmers to build software for just one person.
At the very least I'd expect face and eye tracking which is something neither Cover nor Anycolor have been able to do* for their 3D events yet.
Cost breakdown
XSens Link suit: $15k
Manus gloves: $6k
Some type of face rig hardware: at least $1k not including the camera itself
Computer hardware: $lodsofemone
Studio space: $lodsofemone
Army of assistant directors and staff to make sure everything goes as planned: $lodsofemone
Kson either bought a studio or is working with an established one. Either way, it's cool to see some return on that gachikoi dosh.
---
* Pure speculation on my part, but I believe the reason why big corpos are still on last-gen VR centric 3D tracking is more of a cost issue than anything else. There are indies that have fully functional face tracking but it's very finicky to work with and the more performers you add simultaneously, the more complicated and expensive things get. It's one thing to get eight girls dancing in real time while wearing VR headsets and body trackers (maybe a couple thousand dollars for each person) but another to get eight girls wearing motion capture rigs doing the same, especially when mocap tech isn't really designed to scale like that.
Then you have the cost of hiring experienced Unreal developers and experienced motion capture experts to write software, then add the cost of having to maintain highly specialized hardware that fucking breaks all the time.
In this ancient video from 2018, you can actually see that Cover can actually use head mounted Iphones for 3D face capture, but seems to have chosen not to, probably because dancing with a front heavy helmet on your head is impractical.