College

College of Engineering and Polymer Science

Date of Last Revision

2026-05-14 07:07:24

Major

Computer Science

Honors Course

498-005

Number of Credits

3

Degree Name

Bachelor of Science

Date of Expected Graduation

Spring 2026

Abstract

This project uses digital text mining tools (OCR, NLP, sentiment analysis, and topic modeling) to analyze 19th–20th-century newspaper archives, focusing on how marginalized groups (women, immigrants, or labor workers) were historically portrayed. Many historical newspapers were dominated by elite voices, so this project aims to recover silenced or misrepresented perspectives by identifying hidden patterns in language, frequency of coverage, sentiment, and shifts in public perception over time. Using machine learning and visualization tools, the project will create interactive maps and timelines showing how representation evolved across regions.

Research Sponsor

John Hoag

First Reader

En Cheng

Second Reader

Yingcai Xiao

Honors Faculty Advisor

Zhong-Hui Duan

Proprietary and/or Confidential Information

No

Community Engaged Scholarship

No

Share

COinS
 
 

To view the content in your browser, please download Adobe Reader or, alternately,
you may Download the file to your hard drive.

NOTE: The latest versions of Adobe Reader do not support viewing PDF files within Firefox on Mac OS and if you are using a modern (Intel) Mac, there is no official plugin for viewing PDF files within the browser window.