Large Language Models

Metamorphic-Based Many-Objective Distillation of LLMs for Code-related Tasks

Knowledge distillation compresses large language models (LLMs) into more compact and efficient versions that achieve similar accuracy …

Annibale Panichella

Test Wars: A Comparative Study of SBST, Symbolic Execution, and LLM-Based Approaches to Unit Test Generation

Azat Abdullin, Pouria Derakhshanfar, Annibale Panichella

TestSpark: IntelliJ IDEA’s Ultimate Test Generation Companion

Abstract: Writing software tests is laborious and time-consuming. To address this, prior studies introduced various automated test-generation techniques. A well-explored research direction in this field is unit test generation, wherein artificial intelligence (AI) techniques create tests for a method/class under test.

A. Sapozhnikov, M. Olsthoorn, V.V. Kovalenko, A. Panichella, P. Derakhshanfar

Breaking the Silence: the Threats of Using LLMs in Software Engineering

Large Language Models (LLMs) have gained considerable traction within the Software Engineering (SE) community, impacting various SE tasks from code completion to test generation, from program repair to code summarization. Despite their promise, researchers must still be careful as numerous intricate factors can influence the outcomes of experiments involving LLMs. This paper initiates an open discussion on potential threats to the validity of LLM-based research including issues such as closed-source models, possible data leakage between LLM training data and research evaluation, and the reproducibility of LLM-based findings. In response, this paper proposes a set of guidelines tailored for SE researchers and Language Model (LM) providers to mitigate these concerns. The implications of the guidelines are illustrated using existing good practices followed by LLM providers and a practical example for SE researchers in the context of test case generation.

June Sallou, Thomas Durieux, Annibale Panichella