Automatic detection of source code similarities using machine learning techniques

Authors

  • Marina Elizabeth Cardenas, Doctorando/a Universidad Tecnológica Nacional – Facultad Regional Córdoba - Argentina
  • Julio Javier Castillo Director

DOI:

https://doi.org/10.33414/ajea.4.413.2019

Keywords:

source code, similarities, reuse, machine learning, text, analysis

Abstract

This thesis proposal proposes the development of a model for detection of source code similarities in order to determine the existence of reuse practices applying techniques related to computational linguistics, such as text data mining and natural language processing. The identification of code similarities have several aims, including the study of the evolution of the source code of a project, detection of reuse practices, extraction of a code fragment for “refactoring” of the project, monitoring of defects for correction, among others.

Downloads

Download data is not yet available.

Published

2019-11-05

How to Cite

Cardenas, M. E., & Castillo, J. J. (2019). Automatic detection of source code similarities using machine learning techniques. AJEA (Proceedings of UTN Academic Conferences and Events), (4). https://doi.org/10.33414/ajea.4.413.2019