Teses não defendidas no IFAL
URI Permanente para esta coleção
Navegar
Navegando Teses não defendidas no IFAL por Orientador "Moura, José Antão Beltrão Moura"
Agora exibindo 1 - 1 de 1
Resultados por página
Opções de Ordenação
Item Estimativa de esforço em story point a partir do texto da user story com aprendizagem de máquina e LLM(UNIVERSIDADE FEDERAL DE CAMPINA GRANDE - UFCG, 2025-09-16) Néo, Giseldo da Silva; Moura, José Antão Beltrão Moura; http://lattes.cnpq.br/3572375884177951; Morais, Fábio Jorge Almeida; http://lattes.cnpq.br/0987042606840444; Costa, Evandro de Barros; https://orcid.org/0000-0003-4663-8715; http://lattes.cnpq.br/5760364940162939; Vieira, Thales Miranda de Almeida; https://orcid.org/0000-0001-7775-5258; http://lattes.cnpq.br/8181104476035846; Bezerra, Tarcio Rodrigues; http://lattes.cnpq.br/5285201763618981Effort estimation in agile software projects remains a persistent challenge in the industry, especially when using textual artifacts such as User Stories to predict Story Points. This thesis investigates the use of Natural Language Processing (NLP) and Machine Learning (ML) techniques in effort prediction, considering the textual description of User Stories as the main source of information. Initially, a systematic literature review identified prevalent techniques for the said estimation, such as Term Frequency – Inverse Document Frequency (TF-IDF) combined with Support Vector Machine (SVM), and highlighted gaps related to the use of readability, sentiment, and subjectivity attributes, as well as the sacant of application of Large-Scale Language Models (LLMs) for this task. The research proposed and evaluated three main approaches: (i) the Neo Legibility Effort Model, which uses attributes automatically extracted from User Story text to predict effort; (ii) the Neo User Story Tutor, an LLM-based application that suggests improvements in User Story writing to improve estimation accuracy; and (iii) the Neo LLM Predictor, which uses LLMs to directly estimate Story Points using different strategies (few-shot, zero-shot, and fine-tuning). To support the experiments a new dataset collected from real projects hosted on GitLab, was built (aka NeoDataset). The proposed models were evaluated using metrics such as MAE and compared with established baselines in the literature. The results demonstrated that both readability attributes and LLMs can significantly contribute to improving effort estimates in agile environments. The thesis presents evidence that it is possible to increase estimate accuracy through the combination of textual analysis and machine learning, in addition to highlighting the relevance of linguistic aspects in the quality of User Stories.