College

College of Engineering and Polymer Science

Date of Last Revision

2023-05-04 19:48:13

Major

Computer Science

Honors Course

Senior Honors Project in Computer Science

Number of Credits

Degree Name

Bachelor of Science in Computer Science

Date of Expected Graduation

Spring 2021

Abstract

Source code comment classification is an important problem for future machine learning solutions. In particular, supervised machine learning solutions that have largely subjective data labels but are difficult to obtain the labels for. Machine learning problems are problems largely because of a lack of data. In machine learning solutions, it is better to have a large amount of mediocre data than it is to have a small amount of good data. While the mediocre data might not produce the best accuracy, it produces the best results because there is much more to learn from the problem.

In this project, data was collected from student comment code in computer science classes. This data was then sorted based on various tools in order to create automated source code classification. Various data categorization and sorting methods were explored, ultimately resulting in a process where assigned letter grade was used as a sorting label. Using python, CommentLabeler, and SortAndUnique tools were developed in order to automate the manual source code labeling process. State retention and error checking were also features that were added to streamline the process further.

The most important takeaway from this experience was that the amount of data is much more important than quality. In fact, mediocre data will provide better results with regard to machine learning because there is room for improvement and it proves machine learning as a solution.

Research Sponsor

Dr. Michael L. Collard

First Reader

Yingcai Xiao

Second Reader

Zhong-Hui Duan

Honors Faculty Advisor

Zhong-Hui Duan

Recommended Citation

Sutyak, Cole, "Source Code Comment Classification Artificial Intelligence" (2021). Williams Honors College, Honors Research Projects. 1308.
https://ideaexchange.uakron.edu/honors_research_projects/1308

Download

Included in

Artificial Intelligence and Robotics Commons, Programming Languages and Compilers Commons, Software Engineering Commons

COinS

Williams Honors College, Honors Research Projects

Source Code Comment Classification Artificial Intelligence

College

Date of Last Revision

Major

Honors Course

Number of Credits

Degree Name

Date of Expected Graduation

Abstract

Research Sponsor

First Reader

Second Reader

Honors Faculty Advisor

Recommended Citation

Included in

Browse

Search

Author Corner

Links

Williams Honors College, Honors Research Projects

Source Code Comment Classification Artificial Intelligence

Author

College

Date of Last Revision

Major

Honors Course

Number of Credits

Degree Name

Date of Expected Graduation

Abstract

Research Sponsor

First Reader

Second Reader

Honors Faculty Advisor

Recommended Citation

Included in

Share

Browse

Search

Author Corner

Links