Too Long; Didn't Read
This section evaluates the effectiveness of five Large Language Models in automatically identifying inclusiveness-related user feedback across Reddit, Google Play Store, and Twitter. BART excels on Twitter, while BERT leads for the Play Store. However, Reddit's intricate discussions prove challenging, with GPT-2 emerging as the best performer. The findings shed light on the varied efficacy of LLMs in different contexts, offering valuable insights for developers seeking automated approaches to comprehend user sentiments.