GPTGeoChat🌎: Benchmarking Conversational Geolocation

Repository for "Granular Privacy Control for Geolocation with Vision Language Models"

Main Datasets 🗎

Downloads and Directory Structure ⬇️

The full human annotated GPTGeoChat and AI-generated GPTGeoChat_Synthetic are available for download at the following links:

The directory structure of GPTGeoChat:

human
    │
    └───test
    │   │   
    │   └───annotations
    │   │       ...
    │   └───images
    │           ...
    │
    └───train
    │   │   
    │   └───annotations
    │   │       ...
    │   └───images
    │           ...
    │
    └───val
        │   
        └───annotations
        │       ...
        └───images
                ...

While the structure for GPTGeoChat_Synthetic does not include a train/test/val split:

synthetic
    │
    └───annotations
    │       ...
    └───images
            ...

These datasets include both images and the associated conversations. Images are in files named images/{id}.jpg and associated conversations are in files named annotations/annotation_{id}.json.

Annotated Dialogue Structure 👨‍💻↔️🤖

Annotated dialogues in GPTGeoChat and GPTGeoChat_Synthetic are structured as follows:

{
    "image_path": "../images/{id}.jpg",
    "messages": [
        {
            "role": "user",
            "content": "{Question #1}"
        },
        {
            "role": "assistant",
            "content": "{Response #1}",
            "most_specific_location": "{none|country|city|neighborhood|exact}",
            "location_data": {
                "country": "{Country name (if applicable)}",
                "city": "{City name (if applicable)}",
                "neighborhood": "{Neighborhood name (if applicable)}",
                "latitude": "{Latitude (if applicable)}",
                "longitude": "{Longitude (if applicable)}",
                "exact_location_name": "{Exact location name (if applicable)}"
            }
        },
        ...
    ]
}

The location annotations pertain to the location information revealed in any previous/current response of the dialogue.

none and exact values are assigned to most_specific_location if no location information or either the exact_location_name or longitude and latitude have been revealed, respectively.

Moderation Experiments 🧑‍🔬

Processed Data Files 🗂️

We provide moderation decisions for all baseline, finetuned, and prompted agents in moderation_decisions_baselines, moderation_decisions_finetuned, and moderation_decisions_prompted, respectively. The important keys are:

question_id: instance from the test set of the form {id}_{turn_no}
predicted: agent prediction about whether or not to moderate response (Yes|No)
rationale: reason given for moderation decision (only for prompted agents)

Running Experiments 🧪

Follow the following steps to generate experimental results from the paper:

Clone the repository:

git clone https://github.com/ethanm88/GPTGeoChat.git && cd GPTGeoChat

Download GPTGeoChat in the GPTGeoChat directory.
Unzip GPTGeoChat:

mkdir gptgeochat && unzip human.zip && rm human.zip

Generate Ground Truth Files. This will generate two directories, moderation_decisions_ground_truth and gptgeochat/human/ground_truth_results which aggregate ground truth results differently for efficient computation:

python generate_ground_truths.py

Run Experiments

python generate_eval_metrics.py [--basic_metrics] [--privacy_utility] [--geocoding_distance] [--all] [--recompute_geocoding_results] [--agents]

Experiment Options:

--all: run all three experiments
--basic_metrics: calculate the precision, recall, f1-scores, and f1-score stderrs for binary moderation task. This data was used to generate Figure 3.
--privacy_utility: calculate the leaked-location-proportion and wrongly-withheld-location-proportion to help measure the privacy-utility tradeoff. This data was used to generate Figure 4.
--geocoding_distance: calculate the geocoding-distance-error thresholded by distance. This data was used to generate Figure 5.
Important: This calculation uses previously computed distances using the reverse geocoding API from Geoapify. These files are saved under api_distance_responses.
--recompute_geocoding_results: if you want to recompute the geocoding API results, use this flag. In this case you will need to generate an API key and set the environment variable:

export GEOAPIFY_API_KEY={your_api_key}

--agents: you can specify a list of specific agents to evaluate on as a list e.g. --agents GPT4V synthetic_num_examples=1000

Benchmark Your Agents 🚀

Benchmarking custom agents is easy! Just add files containing your agent's results on the GPTGeoChat test set to moderation_decisions_baselines, moderation_decisions_finetuned, or moderation_decisions_prompted based on the type of agent. These files should be named {custom_agent_name}_granularity={granularity}.jsonl. Running generate_eval_metrics.py with the correct arguments will then evaluate your agents. Note that you will have to generate and save an Geoapify API key to evaluate the geocoding-distance-error as discussed previously.

Citation ✍️

@misc{mendes2024granularprivacycontrol,
      title={Granular Privacy Control for Geolocation with Vision Language Models}, 
      author={Ethan Mendes and Yang Chen and James Hays and Sauvik Das and Wei Xu and Alan Ritter},
      year={2024},
      eprint={2407.04952},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2407.04952}, 
}

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
api_distance_responses		api_distance_responses
moderation_decisions_baselines		moderation_decisions_baselines
moderation_decisions_finetuned		moderation_decisions_finetuned
moderation_decisions_prompted		moderation_decisions_prompted
utils		utils
.gitignore		.gitignore
README.md		README.md
generate_eval_metrics.py		generate_eval_metrics.py
generate_ground_truths.py		generate_ground_truths.py
ground_truth_location_data.json		ground_truth_location_data.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GPTGeoChat🌎: Benchmarking Conversational Geolocation

Main Datasets 🗎

Downloads and Directory Structure ⬇️

Annotated Dialogue Structure 👨‍💻↔️🤖

Moderation Experiments 🧑‍🔬

Processed Data Files 🗂️

Running Experiments 🧪

Benchmark Your Agents 🚀

Citation ✍️

About

Releases

Packages

Languages

ethanm88/GPTGeoChat

Folders and files

Latest commit

History

Repository files navigation

GPTGeoChat🌎: Benchmarking Conversational Geolocation

Main Datasets 🗎

Downloads and Directory Structure ⬇️

Annotated Dialogue Structure 👨‍💻↔️🤖

Moderation Experiments 🧑‍🔬

Processed Data Files 🗂️

Running Experiments 🧪

Benchmark Your Agents 🚀

Citation ✍️

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages