WIP: This is meant to be reference block implmentaiton for various RWKV modules. It is not considered complete
- RWKV states are passed around with native
tuples
, andlist
intentionally, benchmarking show this has a measurable speed bump compared to data state classes when needed. - init state tune weights should be made avaliable via
init_state.x.wkv