Fusion Segment Transformer for AI Generated Music Detection
Fusion Segment Transformer: Bi-directional attention guided fusion network for AI Generated Music Detection
Authors: Yumin Kim and Seonghyeon Go
Submitted to ICASSP 2026. Detects AI-generated music by modeling full audio segments with content-structure fusion.
⚠️ Note: On Zero GPU environment, processing may take ~30 seconds per audio file.