Research
benchmark
AomiBench: Benchmarking Frontier Models on Onchain Execution
A wallet-aware harness for measuring frontier agents on real onchain tasks, simulations, and chain-state evidence.
A wallet-aware harness for measuring frontier agents on real onchain tasks, simulations, and chain-state evidence.