[2401.11237] Closing the Gap between TD Learning and Supervised Learning -- A Generalisation Point of View