From Spatial Language to Spatial Data - a simulation-based approach

funded by the NSF (2021-)

Project Description

User experience related to spatial information is currently directly linked to the representation of the data; that is, geographic co-ordinates pinpoint locations on maps and routing algorithms determine the best route based on distance and time. In contrast, human interaction with the world is based on experience, learning and reasoning on qualitative factors, including spatial concepts, such as near/far, behind, next to, inside. Considering how spatial information is conveyed in natural language, there is no unique mapping between the spatial expressiveness and quantifiable spatial concepts. For the most part this is attributed to the highly contextualized nature of human language; that is, what human language is interpreted in part by who and where it was said and what other words surrounded the comment. The challenge in this project is on devising means to better understand people’s perception of space by deciphering such spatial language terms. This will lead to novel text and audio-based interfaces for the consumption of geospatial data such as when asking for or giving directions in a way that is intuitive to people or for systems that more effectively assist the visually impaired.

The technical aims of the project are divided into two thrusts. The first thrust develops a simulation to crowdsource geospatial language expression data by having users interact in a virtual environment. The spatial language expressions and interactions are captured using quantitative models. The second thrust then uses these user-generated descriptions to evaluate several modeling approaches that include (i) studying specific urban settings to identify contextual factors and (ii) exploring neural approaches to modeling the problem of grounding language to this spatial context. The resulting models can then be used to automatically translate language to geospatial information and, in the reverse direction, to train dialogue agents that can generate enriched, contextualized route and scene descriptions with natural, useful geospatial language expressions.

Participants

Faculty

PI Co-PI
Dieter Pfoser, Geography and Geoinformation Science Antonis Anastasopoulos, Computer Science

Students

Prabin Bhandari, PhD CS Kourosh T. Baghaei, PhD CS

Publications

Acknowledgements

This project is supported by NSF grant III-2127901 under the III Core program.

References

2024

  1. SIGSPATIAL
    urban.png
    Urban Mobility Assessment Using LLMs
    Prabin Bhandari, Antonios Anastasopoulos, and Dieter Pfoser
    In Proceedings of the 32nd ACM International Conference on Advances in Geographic Information Systems, Atlanta, GA, USA, Nov 2024
  2. SIGSPATIAL
    trajectory.png
    Trajectory Anomaly Detection with Language Models
    Jonathan Kabala Mbuya, Dieter Pfoser, and Antonios Anastasopoulos
    In Proceedings of the 32nd ACM International Conference on Advances in Geographic Information Systems, Atlanta, GA, USA, Nov 2024

2023

  1. SIGSPATIAL
    knowledgeable.png
    Are Large Language Models Geospatially Knowledgeable?
    Prabin Bhandari, Antonios Anastasopoulos, and Dieter Pfoser
    In Proceedings of the 31st ACM International Conference on Advances in Geographic Information Systems, Hamburg, Germany, Oct 2023
  2. MRL
    geobiases.png
    Geographic and Geopolitical Biases of Language Models
    Fahim Faisal, and Antonios Anastasopoulos
    In Proceedings of the 3rd Workshop on Multi-lingual Representation Learning (MRL) Code here , Dec 2023
  3. INLG
    Trustworthiness of Children Stories Generated by Large Language Models
    Prabin Bhandari, and Hannah Brennan
    In Proceedings of the 16th International Natural Language Generation Conference Code here , Sep 2023