Navigating the Information-Polluted World.
Our information supply is contaminated. In an era where generating and publishing content is remarkably easy, we are bombarded with irrelevant, redundant, unsolicited, incorrect, and otherwise low-value information.
We, as researchers in Natural Language Understanding, developed this site to facilitate the research on computational methods to address these issues. Specifically we are interested in developing machine learning and inference methods for understanding documents along the aforementioned dimensions.
Define & Address the Research Challenges
The spread of COVID-19 also gives rise to the proliferation of information pollution. We would like to view this as a case study for us to define and address the key research questions raised by the need to navigate our way through the information-polluted space
In the context of COVID-19, we would like to provide a reasonably-sized set of relevant, trustworthy, and timely information that allows users to become quickly informed about specific topics of interest.
To reach our goal of using computational methods to select read-worthy information, we need to overcome many challenges. Here are three main objectives we would like to highlight: (1) Understand Perspectives in Documents, (2) Identify Trustworthy Sources, and (3) Reduce Information Redundancy. We hope to build upon current information recommendation systems and organization schema, and take one step closer to the three objectives above. We hope that our platform will allow critical readers to become more informed about COVID-19.
Our site and underlying machine learning models are open-source. We invite researchers in related fields to join the effort on the project, and potentially contribute to our current or future research efforts.
Documents on similar topics could present different perspectives, and so have different implications. We think it is important to present information in a way that such nuances are visible to readers.
Trustworthiness of the information sources is often a good indication of the reliability of the information itself.
To overcome the trade-off between the selection-bias versus the limited amount of information we are able to present, we seek to present information with a spectrum of diverse perspectives.