Visit our project page to view more cases. Hallo3 has a few simple requirements for the input data of inference: Reference image must be 1:1 or 3:2 aspect ratio. Driving audio must be in WAV format.
This repository contains minimal code and resources for inference using the Kokoro-82M model. The repository supports inference using ONNX Runtime and uses optimized ONNX weights for inference.