forked from cjbarrie/CTA-ED
-
Notifications
You must be signed in to change notification settings - Fork 1
/
Copy path10-week10.Rmd
24 lines (16 loc) · 1.12 KB
/
10-week10.Rmd
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
# Week 10: Validation
This week we'll be thinking about how to validate techniques we've used in the preceding weeks. Validation is a necessary and important part of any text analysis technique.
Often we speak of validation in the context of machine labelling of large text data. But validation need not---and should not---be restricted to automated classification tasks. The articles by @ying_topics_2021 and @rodriguez_models_2021 describe ways to approach validation in unsupervised contexts. Finally, the article by @peterson_classification_2018 shows how validation and accuracy might provide a measure of substantive significance.
**Required reading**:
- @ying_topics_2021
- @rodriguez_models_2021
- @peterson_classification_2018
- @manning_introduction_2007 [ch.2: <https://nlp.stanford.edu/IR-book/information-retrieval-book.html>]
**Further reading**:
- @krippendorff_reliability_2004
- @denny_text_2018
- @grimmer_text_2013-1
- @barbera_automated_2021
- @schiller_stance_2021
**Slides**:
- Week 10 [Slides](https://docs.google.com/presentation/d/1Ib3_7MxS1WKs9Th3hiQ4HmvOsrCBpgAf3EcCWDC5GDc/edit?usp=sharing)