Carolin Holtermann (she/her)

I’m Carolin Holtermann, a third year PhD student at the University of Hamburg, Germany.

GIMMICK: Globally Inclusive Multimodal Multitask Cultural Knowledge Benchmarking

We introduce GIMMICK, an extensive multimodal benchmark designed to assess a broad spectrum of cultural knowledge across 144 countries representing six global macro-regions. GIMMICK comprises six tasks built upon three new datasets that span 728 unique cultural events or facets on which we evaluated 20 LVLMs and 11 LLMs, including five proprietary and 26 open-weight models of all sizes.

SHADES: Towards a Multilingual Assessment of Stereotypes in Large Language Models

While research has attempted to identify and mitigate such biases, most efforts have been concentrated around English, lagging the rapid advancement of LLMs in multilingual settings. In this paper, we introduce a new multilingual dataset SHADES to help address this issue, designed for examining culturally-specific stereotypes that may be learned by LLMs. The dataset includes stereotypes from 20 geopolitical regions and languages, spanning multiple identity cate016 gories subject to discrimination worldwide.

Stay up to date

Get notified when I publish something new, and unsubscribe at any time.

Work

  1. Company
    University of Hamburg
    Role
    PhD Candidate
    Date
  2. Company
    Blue Yonder
    Role
    Data Science Consultant
    Date
  3. Company
    SAP
    Role
    Cloud Consultant
    Date
  4. Company
    SAP
    Role
    Cooperative Student
    Date
Download CV