Hsinchun Chen

Hsinchun Chen

Professor, Management Information Systems
Regents Professor
Member of the Graduate Faculty
Professor, BIO5 Institute
Primary Department
Contact
(520) 621-4153

Research Interest

Dr Chen's areas of expertise include:Security informatics, security big data; smart and connected health, health analytics; data, text, web mining.Digital library, intelligent information retrieval, automatic categorization and classification, machine learning for IR, large-scale information analysis and visualization.Internet resource discovery, digital libraries, IR for large-scale scientific and business databases, customized IR, multilingual IR.Knowledge-based systems design, knowledge discovery in databases, hypertext systems, machine learning, neural networks computing, genetic algorithms, simulated annealing.Cognitive modeling, human-computer interactions, IR behaviors, human problem-solving process.

Publications

Schatz, B., Mischo, W. H., Cole, T. W., Hardin, J. B., Bishop, A. P., & Chen, H. (1996). Federating diverse collections of scientific literature. Computer, 29(5), 28-35.

Abstract:

A University of Illinois project is developing an infrastructure for indexing scientific literature so that multiple Internet sources can be searched as a single federated digital library.

Liu, X., & Chen, H. (2015). Identifying Adverse Drug Events from Patient Social Media A Case Study for Diabetes. IEEE INTELLIGENT SYSTEMS, 30(3), 44-51.
Zhu, B., & Chen, H. (2008). Communication-Garden System: Visualizing a computer-mediated communication process. Decision Support Systems, 45(4), 778-794.

Abstract:

Archives of computer-mediated communication (CMC) could be valuable organizational resources. Most CMC archive systems focus on presenting one of the three aspects of a CMC community: discussion content, participants' behavior, or social networks among participants. Very few CMC archive systems support the easy integration of these three aspects. This paper thus describes two-phase research to propose an automatic approach that facilitates users' integrated understanding of discussion content and behavior of CMC participants. We validated the approach through the development and evaluation of a prototype system, the Communication-Garden system. © 2008 Elsevier B.V. All rights reserved.

Chen, H. (2008). Sentiment and affect analysis of Dark Web forums: measuring radicalization on the internet. IEEE International Conference on Intelligence and Security Informatics, 2008, IEEE ISI 2008, 104-109.

Abstract:

Dark Web forums are heavily used by extremist and terrorist groups for communication, recruiting, ideology sharing, and radicalization. These forums often have relevance to the Iraqi insurgency or Al-Qaeda and are of interest to security and intelligence organizations. This paper presents an automated approach to sentiment and affect analysis of selected radical international Jihadist Dark Web forums. The approach incorporates a rich textual feature representation and machine learning techniques to identify and measure the sentiment polarities and affect intensities expressed in forum communications. The results of sentiment and affect analysis performed on two large-scale Dark Web forums are presented, offering insight into the communities and participants. ©2008 IEEE.

Leroy, G., & Chen, H. (2005). Genescene: An ontology-enhanced integration of linguistic and Co-occurrence based relations in biomedical Texts. Journal of the American Society for Information Science and Technology, 56(5), 457-468.

Abstract:

The increasing amount of publicly available literature and experimental data in biomedicine makes it hard for biomedical researchers to stay up-to-date. Genescene is a toolkit that will help alleviate this problem by providing an overview of published literature content. We combined a linguistic parser with Concept Space, a co-occurrence based semantic net. Both techniques extract complementary biomedical relations between noun phrases from MEDLINE abstracts. The parser extracts precise and semantically rich relations from individual abstracts. Concept Space extracts relations that hold true for the collection of abstracts. The Gene Ontology, the Human Genome Nomenclature, and the Unified Medical Language System, are also integrated in Genescene. Currently, they are used to facilitate the integration of the two relation types, and to select the more interesting and high-quality relations for presentation. A user study focusing on p53 literature is discussed. All MEDLINE abstracts discussing p53 were processed in Genescene. Two researchers evaluated the terms and relations from several abstracts of interest to them. The results show that the terms were precise (precision 93%) and relevant, as were the parser relations (precision 95%). The Concept Space relations were more precise when selected with ontological knowledge (precision 78%) than without (60%). © 2005 Wiley Periodicals, Inc.