Generative Pre-trained Auto-regressive Diffusion Transformer (GPDiT)
GPDiT (Generative Pre-trained Auto-regressive Diffusion Transformer) combines diffusion modeling with transformer architecture for powerful video recoloring. Operating in latent space with a parameter-free rotation-based time conditioning mechanism and lightweight causal attention, it enables remarkable few-shot learning capabilities. This breakthrough model generates temporally consistent, high-quality colorized videos from grayscale inputs with minimal examples needed for adaptation to specific styles.