Skip to content

dkltimon/EmbeddedPoems

Repository files navigation

EmbeddedPoems

Data & Code for analyzing embedded poems in Ming & Qing novels.

The file first_round_tesing_360_poems_annotations_predictions.csv contains our annotated dataset of 360 classical Chinese poems. The file includes 3 columns with the following types of information:

  • poem: The poems extracted from novels.
  • Coding_annotated: Human-annotated categories identifying the narrative functions of the poem.
  • Coding_predicted: Label predicted by ChatGPT (GPT-3.5 free version).

The file second_round_tesing_339_poems_annotations_responses_UsinglongPrompt.csv contains our annotated dataset of 339 classical Chinese text excerpts. The file includes 14 columns with the following types of information:

  • scource: The novel from which a poem was extracted.

  • text: The excerpt from novels, each row contains a poem (marked using p_s and p_e) and its context.

  • Gold standard annotations (*_Gold): Human-annotated categories identifying the narrative perspective, content, and position of embedded poems in Ming & Qing novels.

  • Label answered by different models from three large language models:

    • ChatGPT (*_ChatGPT)
    • LlamaChinese (*_LlamaChinese)
    • Llama3 (*_Llama3)

Each model provides its predictions for:

  • Perspective (narrator, character)
  • Content (commentary, plot, character portraiture, scene)
  • Position (beginning, middle, end)

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages