annot

Analysis of manual annotation of gendered and gender biased language in archival metadata descriptions using the brat rapid annotation tool.

Annotation Taxonomy

Gendered and Gender Biased Language
├── Person Name
│   ├── Unknown
│   ├── Non-binary
│   ├── Feminine
│   └── Masculine
├── Linguistic
│   ├── Generalization
│   ├── Gendered Pronoun
│   └── Gendered Role
└── Contextual
    ├── Empowering
    ├── Occupation
    ├── Omission
    └── Stereotype

Directory Structure

annot/
├── AnnotationInstructions.docx
├── data/  
│   ├── analysis_data/ (**hidden in GitHub repo**)
│   ├── iaa/
│   └── sample/
├── notebooks/
│   ├── aggregating_data/
│   ├── analyzing_data/
│   ├── cleaning_metadata/
│   └── preparing_data
├── .gitignore
└── README.md

AnnotationInstructions.docx: instructions given to the annotators for labeling archival metadata descriptions in brat (includes the annotation taxonomy)
data:
- data/sample: directory with a sample of the annotated data as a CSV file
- data/iaa: inter-annotator agreement scores per annotator and per label
- Note: annotated data will be uploaded to this directory after further analysis
notebooks: code written to prepare, aggregate, and analyze the annotated data, and to clean additional metadata fields associated with the annotated data (e.g., date of material, language of material)

Associated Resources

Data source: Archives Online, Centre for Research Collections, University of Edinburgh
Dataset preparation repository: annot-prep
Publications:
- Research methodology: Situated Data, Situated Systems: A Methodology to Engage with Power Relations in Natural Language Processing Research
- Annotation taxonomy and data creation: Uncertainty and Inclusivity in Gender Bias Annotation: An Annotation Taxonomy and Annotated Datasets of British English Text

License and Citation

Creative Commons Attribution 4.0 International (CC BY 4.0)

@inproceedings{havens-etal-2022-uncertainty,
    title = "Uncertainty and Inclusivity in Gender Bias Annotation: An Annotation Taxonomy and Annotated Datasets of {B}ritish {E}nglish Text",
    author = "Havens, Lucy  and
      Terras, Melissa and
      Bach, Benjamin  and
      Alex, Beatrice",
    booktitle = "Proceedings of the 4th Workshop on Gender Bias in Natural Language Processing (GeBNLP)",
    month = jul,
    year = "2022",
    address = "Seattle, Washington",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2022.gebnlp-1.4",
    pages = "30--57"
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

annot

Annotation Taxonomy

Table of Contents

Directory Structure

Contents

Associated Resources

License and Citation

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
data		data
notebooks		notebooks
.gitignore		.gitignore
AnnotationInstructions.docx		AnnotationInstructions.docx
README.md		README.md

thegoose20/annot

Folders and files

Latest commit

History

Repository files navigation

annot

Annotation Taxonomy

Table of Contents

Directory Structure

Contents

Associated Resources

License and Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages