[2404.09696] Are Large Language Models Reliable Argument Quality Annotators?