Facilitating Language Technologies for Crisis Preparedness and Response

funded by the NSF (2023-)

Project Description

Language technologies are promising and could have strong impact during disaster responses. they can help to triage text messages in a disaster to determine what aid to provide. Language technologies can translate vast amounts of data related to an ongoing pandemic. Responders can use these technologies to converse with victims during disaster responses. However, advances in language technologies to date are limited. They focus on a few dozen of the more than 6500 languages spoken or signed in the world today. Current language technologies neglect millions of people. This especially impacts those who are most at risk for experiencing disasters. This project provides an infrastructure for language technology advancements for crisis response. The results will be useful for everyone, no matter the language they speak.

This project builds datasets of crisis communications using dedicated data collections and social media harvesting. These datasets will be applicable to curated crisis scenarios. They will use common language scenarios necessary to communicate with vulnerable populations. This approach helps people for whom language technologies are not typically developed. The project will bring together researchers from different disciplines. These include language technology researchers, experts in disaster relief, linguistics, and human-computer interaction. The project will target representatives from the local speech communities to take part. To coordinate this effort, the project will organize yearly workshops and shared tasks with the communities.

This is a collaborative project between George Mason University and the University of Washington.

Participants

Faculty

Antonis Anastasopoulos, GMU
Fei Xia, UW

Students

Belu Ticona, PhD CS

Publications

Acknowledgements

This project is supported by NSF grants CNS-2234895 and CNS-2346334 under the CIRC (previously CCRI) program.

References

2024

  1. NLP4PI
    epidemiology.png
    From Text to Maps: LLM-Driven Extraction and Geotagging of Epidemiological Data
    Karlyn K. Harrod*, Prabin Bhandari*, and Antonios Anastasopoulos
    In Proceedings of the Third Workshop on NLP for Positive Impact code here , Nov 2024
  2. ClimateNLP
    climate.png
    Unlearning Climate Misinformation in Large Language Models
    Michael Fore, Simranjit Singh, Chaehong Lee, Amritanshu Pandey, Antonios Anastasopoulos, and Dimitrios Stamoulis
    In Proceedings of the 1st Workshop on Natural Language Processing Meets Climate Change (ClimateNLP 2024), Aug 2024
  3. JAMIA
    Clinical risk prediction using language models: benefits and considerations
    Angeela Acharya, Sulabh Shrestha, Anyi Chen, Joseph Conte, Sanja Avramovic, Siddhartha Sikdar, Antonios Anastasopoulos, and Sanmay Das
    Journal of the American Medical Informatics Association, Feb 2024