Distortions in Judged Spatial Relations in Large Language Models: The Dawn of Natural Language Geographic Data?

arXiv preprint arXiv:2401.04218

Published On 2024/1/8

We present a benchmark for assessing the capability of Large Language Models (LLMs) to discern intercardinal directions between geographic locations and apply it to three prominent LLMs: GPT-3.5, GPT-4, and Llama-2. This benchmark specifically evaluates whether LLMs exhibit a hierarchical spatial bias similar to humans, where judgments about individual locations' spatial relationships are influenced by the perceived relationships of the larger groups that contain them. To investigate this, we formulated 14 questions focusing on well-known American cities. Seven questions were designed to challenge the LLMs with scenarios potentially influenced by the orientation of larger geographical units, such as states or countries, while the remaining seven targeted locations less susceptible to such hierarchical categorization. Among the tested models, GPT-4 exhibited superior performance with 55.3% accuracy, followed by GPT-3.5 at 47.3%, and Llama-2 at 44.7%. The models showed significantly reduced accuracy on tasks with suspected hierarchical bias. For example, GPT-4's accuracy dropped to 32.9% on these tasks, compared to 85.7% on others. Despite these inaccuracies, the models identified the nearest cardinal direction in most cases, suggesting associative learning, embodying human-like misconceptions. We discuss the potential of text-based data representing geographic relationships directly to improve the spatial reasoning capabilities of LLMs.

Journal

arXiv preprint arXiv:2401.04218

Published On

2024/1/8

Authors

Alexander Zipf

Ruprecht-Karls-Universität Heidelberg

Position

Chair of GIScience HeiGIT Heidelberg Institute for Geoinformation Technology

H-Index(all)

H-Index(since 2020)

I-10 Index(all)

I-10 Index(since 2020)

Citation(all)

Citation(since 2020)

Cited By

Research Interests

Geoinformatics

GIScience

VGI

Geomatics

Geographic Information Science

University Profile Page

Ruprecht-Karls-Universität Heidelberg

Access Email

Abdulkadir Memduhoğlu

Harran Üniversitesi

Position

Geomatic Engineering

H-Index(all)

H-Index(since 2020)

I-10 Index(all)

I-10 Index(since 2020)

Citation(all)

Citation(since 2020)

Cited By

Research Interests

Geospatial Semantic Web

Cartography

GIScience

University Profile Page

Harran Üniversitesi

Access Email

Nir Fulman

Tel Aviv University

Position

PhD candidate

H-Index(all)

H-Index(since 2020)

I-10 Index(all)

I-10 Index(since 2020)

Citation(all)

Citation(since 2020)

Cited By

Research Interests

Spatial modeling

Transportation

GIS

University Profile Page

Tel Aviv University

Access Email

Other Articles from authors

Alexander Zipf

Ruprecht-Karls-Universität Heidelberg

Geo-spatial Information Science

An investigation of the temporality of OpenStreetMap data contribution activities

OpenStreetMap (OSM) is a dataset in constant change and this dynamic needs to be better understood. Based on 12-year time series of seven OSM data contribution activities extracted from 20 large cities worldwide, we investigate the temporal dynamic of OSM data production, more specifically, the auto- and cross-correlation, temporal trend, and annual seasonality of these activities. Furthermore, we evaluate and compare nine different temporal regression methods for forecasting such activities in horizons of 1–4 weeks. Several insights could be obtained from our analyses, including that the contribution activities tend to grown linearly in a moderate intra-annual cycle. Also, the performance of the temporal forecasting methods shows that they yield in general more accurate estimations of future contribution activities than a baseline metric, i.e. the arithmetic average of recent previous observations. In particular, the …

2024/3/3

Distortions in Judged Spatial Relations in Large Language Models: The Dawn of Natural Language Geographic Data?

Authors

Alexander Zipf

Ruprecht-Karls-Universität Heidelberg

Abdulkadir Memduhoğlu

Harran Üniversitesi

Nir Fulman

Tel Aviv University

Other Articles from authors

An investigation of the temporality of OpenStreetMap data contribution activities

A project-based view of urban dynamics: Analyzing ‘leapfrogging’ and fringe development in Israel

Semantic enrichment of building functions through geospatial data integration and ontological inference

How to assess the needs of vulnerable population groups towards heat-sensitive routing? An evidence-based and practical approach to reducing urban heat stress

Residential Greenness and Long-term Mortality Among Patients Who Underwent Coronary Artery Bypass Graft Surgery

OpenStreetMap Data for Automated Labelling Machine Learning Examples: The Challenge of Road Type Imbalance

Carbon fluxes related to land use and land cover change in Baden-Württemberg

Evaluating the ground point classification performance of Agisoft Metashape Software

Urban Heat Island Intensity Prediction in the Context of Heat Waves: An Evaluation of Model Performance

Investigating occasional travel patterns based on smartcard transactions

Exploring road and points of interest (POIs) associations in OpenStreetMap, a new paradigm for OSM road class prediction

Semi-supervised water tank detection to support vector control of emerging infectious diseases transmitted by Aedes Aegypti

Determination of suitable areas for wind power plant installation in Şanlıurfa with GIS and AHP

A spatio-temporal analysis investigating completeness and inequalities of global urban building data in OpenStreetMap

Exploring Non-Routine Trips Through Smartcard Transaction Analysis

Challenges and solution approach for greenhouse gas emission inventories at fine spatial resolutions–the example of the Rhine-Neckar district

Initial response to the COVID-19 pandemic on real-life well-being, social contact and roaming behavior in patients with schizophrenia, major depression and healthy controls: A …

Other articles from arXiv preprint arXiv:2401.04218 journal

Distortions in Judged Spatial Relations in Large Language Models: The Dawn of Natural Language Geographic Data?

Distortions in Judged Spatial Relations in Large Language Models: The Dawn of Natural Language Geographic Data?

Distortions in Judged Spatial Relations in Large Language Models: The Dawn of Natural Language Geographic Data?