diff --git a/_analyzers/language-analyzers.md b/_analyzers/language-analyzers.md
deleted file mode 100644
index ca4ba320dd..0000000000
--- a/_analyzers/language-analyzers.md
+++ /dev/null
@@ -1,44 +0,0 @@
----
-layout: default
-title: Language analyzers
-nav_order: 100
-parent: Analyzers
-redirect_from:
-  - /query-dsl/analyzers/language-analyzers/
----
-
-# Language analyzers
-
-OpenSearch supports the following language analyzers:
-`arabic`, `armenian`, `basque`, `bengali`, `brazilian`, `bulgarian`, `catalan`, `czech`, `danish`, `dutch`, `english`, `estonian`, `finnish`, `french`, `galician`, `german`, `greek`, `hindi`, `hungarian`, `indonesian`, `irish`, `italian`, `latvian`, `lithuanian`, `norwegian`, `persian`, `portuguese`, `romanian`, `russian`, `sorani`, `spanish`, `swedish`, `turkish`, and `thai`.
-
-To use the analyzer when you map an index, specify the value within your query. For example, to map your index with the French language analyzer, specify the `french` value for the analyzer field:
-
-```json
- "analyzer": "french"
-```
-
-#### Example request
-
-The following query specifies the `french` language analyzer for the index `my-index`:
-
-```json
-PUT my-index
-{
-  "mappings": {
-    "properties": {
-      "text": { 
-        "type": "text",
-        "fields": {
-          "french": { 
-            "type": "text",
-            "analyzer": "french"
-          }
-        }
-      }
-    }
-  }
-}
-```
-
-<!-- TO do: each of the options needs its own section with an example. Convert table to individual sections, and then give a streamlined list with valid values. -->
diff --git a/_analyzers/language-analyzers/arabic.md b/_analyzers/language-analyzers/arabic.md
new file mode 100644
index 0000000000..e61c684cbb
--- /dev/null
+++ b/_analyzers/language-analyzers/arabic.md
@@ -0,0 +1,182 @@
+---
+layout: default
+title: Arabic
+parent: Language analyzers
+grand_parent: Analyzers
+nav_order: 10
+---
+
+# Arabic analyzer
+
+The built-in `arabic` analyzer can be applied to a text field using the following command:
+
+```json
+PUT /arabic-index
+{
+  "mappings": {
+    "properties": {
+      "content": {
+        "type": "text",
+        "analyzer": "arabic"
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Stem exclusion
+
+You can use `stem_exclusion` with this language analyzer using the following command:
+
+```json
+PUT index_with_stem_exclusion_arabic
+{
+  "settings": {
+    "analysis": {
+      "analyzer": {
+        "stem_exclusion_arabic_analyzer":{
+          "type":"arabic",
+          "stem_exclusion":["تكنولوجيا","سلطة "]
+        }
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Arabic analyzer internals
+
+The `arabic` analyzer is built using the following components:
+
+- Tokenizer: `standard`
+
+- Token filters:
+  - lowercase
+  - decimal_digit
+  - stop (Arabic)
+  - normalization (Arabic)
+  - keyword
+  - stemmer (Arabic)
+
+## Custom Arabic analyzer
+
+You can create a custom Arabic analyzer using the following command:
+
+```json
+PUT /arabic-index
+{
+  "settings": {
+    "analysis": {
+      "filter": {
+        "arabic_stop": {
+          "type": "stop",
+          "stopwords": "_arabic_"
+        },
+        "arabic_stemmer": {
+          "type": "stemmer",
+          "language": "arabic"
+        },
+        "arabic_normalization": {
+          "type": "arabic_normalization"
+        },
+        "decimal_digit": {
+          "type": "decimal_digit"
+        },
+        "arabic_keywords": {
+          "type":       "keyword_marker",
+          "keywords":   [] 
+        }
+      },
+      "analyzer": {
+        "arabic_analyzer": {
+          "type": "custom",
+          "tokenizer": "standard",
+          "filter": [
+            "lowercase",
+            "arabic_normalization",
+            "decimal_digit",
+            "arabic_stop",
+            "arabic_keywords",
+            "arabic_stemmer"
+          ]
+        }
+      }
+    }
+  },
+  "mappings": {
+    "properties": {
+      "content": {
+        "type": "text",
+        "analyzer": "arabic_analyzer"
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Generated tokens
+
+Use the following request to examine the tokens generated using the analyzer:
+
+```json
+POST /arabic-index/_analyze
+{
+  "field": "content",
+  "text": "الطلاب يدرسون في الجامعات العربية. أرقامهم ١٢٣٤٥٦."
+}
+```
+{% include copy-curl.html %}
+
+The response contains the generated tokens:
+
+```json
+{
+  "tokens": [
+    {
+      "token": "طلاب",
+      "start_offset": 0,
+      "end_offset": 6,
+      "type": "<ALPHANUM>",
+      "position": 0
+    },
+    {
+      "token": "يدرس",
+      "start_offset": 7,
+      "end_offset": 13,
+      "type": "<ALPHANUM>",
+      "position": 1
+    },
+    {
+      "token": "جامع",
+      "start_offset": 17,
+      "end_offset": 25,
+      "type": "<ALPHANUM>",
+      "position": 3
+    },
+    {
+      "token": "عرب",
+      "start_offset": 26,
+      "end_offset": 33,
+      "type": "<ALPHANUM>",
+      "position": 4
+    },
+    {
+      "token": "ارقامهم",
+      "start_offset": 35,
+      "end_offset": 42,
+      "type": "<ALPHANUM>",
+      "position": 5
+    },
+    {
+      "token": "123456",
+      "start_offset": 43,
+      "end_offset": 49,
+      "type": "<NUM>",
+      "position": 6
+    }
+  ]
+}
+```
\ No newline at end of file
diff --git a/_analyzers/language-analyzers/armenian.md b/_analyzers/language-analyzers/armenian.md
new file mode 100644
index 0000000000..9bd0549c80
--- /dev/null
+++ b/_analyzers/language-analyzers/armenian.md
@@ -0,0 +1,137 @@
+---
+layout: default
+title: Armenian
+parent: Language analyzers
+grand_parent: Analyzers
+nav_order: 20
+---
+
+# Armenian analyzer
+
+The built-in `armenian` analyzer can be applied to a text field using the following command:
+
+```json
+PUT /arabic-index
+{
+  "mappings": {
+    "properties": {
+      "content": {
+        "type": "text",
+        "analyzer": "armenian"
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Stem exclusion
+
+You can use `stem_exclusion` with this language analyzer using the following command:
+
+```json
+PUT index_with_stem_exclusion_armenian_analyzer
+{
+  "settings": {
+    "analysis": {
+      "analyzer": {
+        "stem_exclusion_armenian_analyzer": {
+          "type": "armenian",
+          "stem_exclusion": ["բարև", "խաղաղություն"] 
+        }
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Armenian analyzer internals
+
+The `armenian` analyzer is built using the following components:
+
+- Tokenizer: `standard`
+
+- Token filters:
+  - lowercase
+  - stop (Armenian)
+  - keyword
+  - stemmer (Armenian)
+
+## Custom Armenian analyzer
+
+You can create a custom Armenian analyzer using the following command:
+
+```json
+PUT /armenian-index
+{
+  "settings": {
+    "analysis": {
+      "filter": {
+        "armenian_stop": {
+          "type": "stop",
+          "stopwords": "_armenian_"
+        },
+        "armenian_stemmer": {
+          "type": "stemmer",
+          "language": "armenian"
+        },
+        "armenian_keywords": {
+          "type":       "keyword_marker",
+          "keywords":   [] 
+        }
+      },
+      "analyzer": {
+        "armenian_analyzer": {
+          "type": "custom",
+          "tokenizer": "standard",
+          "filter": [
+            "lowercase",
+            "armenian_stop",
+            "armenian_keywords",
+            "armenian_stemmer"
+          ]
+        }
+      }
+    }
+  },
+  "mappings": {
+    "properties": {
+      "content": {
+        "type": "text",
+        "analyzer": "armenian_analyzer"
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Generated tokens
+
+Use the following request to examine the tokens generated using the analyzer:
+
+```json
+GET armenian-index/_analyze
+{
+  "analyzer": "stem_exclusion_armenian_analyzer",
+  "text": "բարև բոլորին, մենք խաղաղություն ենք ուզում և նոր օր ենք սկսել"
+}
+```
+{% include copy-curl.html %}
+
+The response contains the generated tokens:
+
+```json
+{
+  "tokens": [
+    {"token": "բարև","start_offset": 0,"end_offset": 4,"type": "<ALPHANUM>","position": 0},
+    {"token": "բոլոր","start_offset": 5,"end_offset": 12,"type": "<ALPHANUM>","position": 1},
+    {"token": "խաղաղություն","start_offset": 19,"end_offset": 31,"type": "<ALPHANUM>","position": 3},
+    {"token": "ուզ","start_offset": 36,"end_offset": 42,"type": "<ALPHANUM>","position": 5},
+    {"token": "նոր","start_offset": 45,"end_offset": 48,"type": "<ALPHANUM>","position": 7},
+    {"token": "օր","start_offset": 49,"end_offset": 51,"type": "<ALPHANUM>","position": 8},
+    {"token": "սկսել","start_offset": 56,"end_offset": 61,"type": "<ALPHANUM>","position": 10}
+  ]
+}
+```
\ No newline at end of file
diff --git a/_analyzers/language-analyzers/basque.md b/_analyzers/language-analyzers/basque.md
new file mode 100644
index 0000000000..e73510cc66
--- /dev/null
+++ b/_analyzers/language-analyzers/basque.md
@@ -0,0 +1,137 @@
+---
+layout: default
+title: Basque
+parent: Language analyzers
+grand_parent: Analyzers
+nav_order: 30
+---
+
+# Basque analyzer
+
+The built-in `basque` analyzer can be applied to a text field using the following command:
+
+```json
+PUT /basque-index
+{
+  "mappings": {
+    "properties": {
+      "content": {
+        "type": "text",
+        "analyzer": "basque"
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Stem exclusion
+
+You can use `stem_exclusion` with this language analyzer using the following command:
+
+```json
+PUT index_with_stem_exclusion_basque_analyzer
+{
+  "settings": {
+    "analysis": {
+      "analyzer": {
+        "stem_exclusion_basque_analyzer": {
+          "type": "basque",
+          "stem_exclusion": ["autoritate", "baldintza"]
+        }
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Basque analyzer internals
+
+The `basque` analyzer is built using the following components:
+
+- Tokenizer: `standard`
+
+- Token filters:
+  - lowercase
+  - stop (Basque)
+  - keyword
+  - stemmer (Basque)
+
+## Custom Basque analyzer
+
+You can create a custom Basque analyzer using the following command:
+
+```json
+PUT /basque-index
+{
+  "settings": {
+    "analysis": {
+      "filter": {
+        "basque_stop": {
+          "type": "stop",
+          "stopwords": "_basque_"
+        },
+        "basque_stemmer": {
+          "type": "stemmer",
+          "language": "basque"
+        },
+        "basque_keywords": {
+          "type":       "keyword_marker",
+          "keywords":   [] 
+        }
+      },
+      "analyzer": {
+        "basque_analyzer": {
+          "type": "custom",
+          "tokenizer": "standard",
+          "filter": [
+            "lowercase",
+            "basque_stop",
+            "basque_keywords",
+            "basque_stemmer"
+          ]
+        }
+      }
+    }
+  },
+  "mappings": {
+    "properties": {
+      "content": {
+        "type": "text",
+        "analyzer": "basque_analyzer"
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Generated tokens
+
+Use the following request to examine the tokens generated using the analyzer:
+
+```json
+POST /basque-index/_analyze
+{
+  "field": "content",
+  "text": "Ikasleek euskal unibertsitateetan ikasten dute. Haien zenbakiak 123456 dira."
+}
+```
+{% include copy-curl.html %}
+
+The response contains the generated tokens:
+
+```json
+{
+  "tokens": [
+    {"token": "ikasle","start_offset": 0,"end_offset": 8,"type": "<ALPHANUM>","position": 0},
+    {"token": "euskal","start_offset": 9,"end_offset": 15,"type": "<ALPHANUM>","position": 1},
+    {"token": "unibertsi","start_offset": 16,"end_offset": 33,"type": "<ALPHANUM>","position": 2},
+    {"token": "ikas","start_offset": 34,"end_offset": 41,"type": "<ALPHANUM>","position": 3},
+    {"token": "haien","start_offset": 48,"end_offset": 53,"type": "<ALPHANUM>","position": 5},
+    {"token": "zenba","start_offset": 54,"end_offset": 63,"type": "<ALPHANUM>","position": 6},
+    {"token": "123456","start_offset": 64,"end_offset": 70,"type": "<NUM>","position": 7}
+  ]
+}
+```
\ No newline at end of file
diff --git a/_analyzers/language-analyzers/bengali.md b/_analyzers/language-analyzers/bengali.md
new file mode 100644
index 0000000000..af913a01ef
--- /dev/null
+++ b/_analyzers/language-analyzers/bengali.md
@@ -0,0 +1,142 @@
+---
+layout: default
+title: Bengali
+parent: Language analyzers
+grand_parent: Analyzers
+nav_order: 40
+---
+
+# Bengali analyzer
+
+The built-in `bengali` analyzer can be applied to a text field using the following command:
+
+```json
+PUT /bengali-index
+{
+  "mappings": {
+    "properties": {
+      "content": {
+        "type": "text",
+        "analyzer": "bengali"
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Stem exclusion
+
+You can use `stem_exclusion` with this language analyzer using the following command:
+
+```json
+PUT index_with_stem_exclusion_bengali_analyzer
+{
+  "settings": {
+    "analysis": {
+      "analyzer": {
+        "stem_exclusion_bengali_analyzer": {
+          "type": "bengali",
+          "stem_exclusion": ["কর্তৃপক্ষ", "অনুমোদন"]
+        }
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Bengali analyzer internals
+
+The `bengali` analyzer is built using the following components:
+
+- Tokenizer: `standard`
+
+- Token filters:
+  - lowercase
+  - decimal_digit
+  - indic_normalization
+  - normalization (Bengali)
+  - stop (Bengali)
+  - keyword
+  - stemmer (Bengali)
+
+## Custom Bengali analyzer
+
+You can create a custom Bengali analyzer using the following command:
+
+```json
+PUT /bengali-index
+{
+  "settings": {
+    "analysis": {
+      "filter": {
+        "bengali_stop": {
+          "type": "stop",
+          "stopwords": "_bengali_"
+        },
+        "bengali_stemmer": {
+          "type": "stemmer",
+          "language": "bengali"
+        },
+        "bengali_keywords": {
+          "type":       "keyword_marker",
+          "keywords":   [] 
+        }
+      },
+      "analyzer": {
+        "bengali_analyzer": {
+          "type": "custom",
+          "tokenizer": "standard",
+          "filter": [
+            "lowercase",
+            "decimal_digit",
+            "indic_normalization",
+            "bengali_normalization",
+            "bengali_stop",
+            "bengali_keywords",
+            "bengali_stemmer"
+          ]
+        }
+      }
+    }
+  },
+  "mappings": {
+    "properties": {
+      "content": {
+        "type": "text",
+        "analyzer": "bengali_analyzer"
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Generated tokens
+
+Use the following request to examine the tokens generated using the analyzer:
+
+```json
+POST /bengali-index/_analyze
+{
+  "field": "content",
+  "text": "ছাত্ররা বিশ্ববিদ্যালয়ে পড়াশোনা করে। তাদের নম্বরগুলি ১২৩৪৫৬।"
+}
+```
+{% include copy-curl.html %}
+
+The response contains the generated tokens:
+
+```json
+{
+  "tokens": [
+    {"token": "ছাত্র","start_offset": 0,"end_offset": 7,"type": "<ALPHANUM>","position": 0},
+    {"token": "বিসসবিদালয়","start_offset": 8,"end_offset": 23,"type": "<ALPHANUM>","position": 1},
+    {"token": "পরাসোন","start_offset": 24,"end_offset": 32,"type": "<ALPHANUM>","position": 2},
+    {"token": "তা","start_offset": 38,"end_offset": 43,"type": "<ALPHANUM>","position": 4},
+    {"token": "নমমর","start_offset": 44,"end_offset": 53,"type": "<ALPHANUM>","position": 5},
+    {"token": "123456","start_offset": 54,"end_offset": 60,"type": "<NUM>","position": 6}
+  ]
+}
+```
\ No newline at end of file
diff --git a/_analyzers/language-analyzers/brazilian.md b/_analyzers/language-analyzers/brazilian.md
new file mode 100644
index 0000000000..67db2b92bc
--- /dev/null
+++ b/_analyzers/language-analyzers/brazilian.md
@@ -0,0 +1,137 @@
+---
+layout: default
+title: Brazilian
+parent: Language analyzers
+grand_parent: Analyzers
+nav_order: 50
+---
+
+# Brazilian analyzer
+
+The built-in `brazilian` analyzer can be applied to a text field using the following command:
+
+```json
+PUT /brazilian-index
+{
+  "mappings": {
+    "properties": {
+      "content": {
+        "type": "text",
+        "analyzer": "brazilian"
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Stem exclusion
+
+You can use `stem_exclusion` with this language analyzer using the following command:
+
+```json
+PUT index_with_stem_exclusion_brazilian_analyzer
+{
+  "settings": {
+    "analysis": {
+      "analyzer": {
+        "stem_exclusion_brazilian_analyzer": {
+          "type": "brazilian",
+          "stem_exclusion": ["autoridade", "aprovação"]
+        }
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Brazilian analyzer internals
+
+The `brazilian` analyzer is built using the following components:
+
+- Tokenizer: `standard`
+
+- Token filters:
+  - lowercase
+  - stop (Brazilian)
+  - keyword
+  - stemmer (Brazilian)
+
+## Custom Brazilian analyzer
+
+You can create a custom Brazilian analyzer using the following command:
+
+```json
+PUT /brazilian-index
+{
+  "settings": {
+    "analysis": {
+      "filter": {
+        "brazilian_stop": {
+          "type": "stop",
+          "stopwords": "_brazilian_"
+        },
+        "brazilian_stemmer": {
+          "type": "stemmer",
+          "language": "brazilian"
+        },
+        "brazilian_keywords": {
+          "type":       "keyword_marker",
+          "keywords":   [] 
+        }
+      },
+      "analyzer": {
+        "brazilian_analyzer": {
+          "type": "custom",
+          "tokenizer": "standard",
+          "filter": [
+            "lowercase",
+            "brazilian_stop",
+            "brazilian_keywords",
+            "brazilian_stemmer"
+          ]
+        }
+      }
+    }
+  },
+  "mappings": {
+    "properties": {
+      "content": {
+        "type": "text",
+        "analyzer": "brazilian_analyzer"
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Generated tokens
+
+Use the following request to examine the tokens generated using the analyzer:
+
+```json
+POST /brazilian-index/_analyze
+{
+  "field": "content",
+  "text": "Estudantes estudam em universidades brasileiras. Seus números são 123456."
+}
+```
+{% include copy-curl.html %}
+
+The response contains the generated tokens:
+
+```json
+{
+  "tokens": [
+    {"token": "estudant","start_offset": 0,"end_offset": 10,"type": "<ALPHANUM>","position": 0},
+    {"token": "estud","start_offset": 11,"end_offset": 18,"type": "<ALPHANUM>","position": 1},
+    {"token": "univers","start_offset": 22,"end_offset": 35,"type": "<ALPHANUM>","position": 3},
+    {"token": "brasileir","start_offset": 36,"end_offset": 47,"type": "<ALPHANUM>","position": 4},
+    {"token": "numer","start_offset": 54,"end_offset": 61,"type": "<ALPHANUM>","position": 6},
+    {"token": "sao","start_offset": 62,"end_offset": 65,"type": "<ALPHANUM>","position": 7},
+    {"token": "123456","start_offset": 66,"end_offset": 72,"type": "<NUM>","position": 8}
+  ]
+}
+```
\ No newline at end of file
diff --git a/_analyzers/language-analyzers/bulgarian.md b/_analyzers/language-analyzers/bulgarian.md
new file mode 100644
index 0000000000..42d5794e18
--- /dev/null
+++ b/_analyzers/language-analyzers/bulgarian.md
@@ -0,0 +1,137 @@
+---
+layout: default
+title: Bulgarian
+parent: Language analyzers
+grand_parent: Analyzers
+nav_order: 60
+---
+
+# Bulgarian analyzer
+
+The built-in `bulgarian` analyzer can be applied to a text field using the following command:
+
+```json
+PUT /bulgarian-index
+{
+  "mappings": {
+    "properties": {
+      "content": {
+        "type": "text",
+        "analyzer": "bulgarian"
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Stem exclusion
+
+You can use `stem_exclusion` with this language analyzer using the following command:
+
+```json
+PUT index_with_stem_exclusion_bulgarian_analyzer
+{
+  "settings": {
+    "analysis": {
+      "analyzer": {
+        "stem_exclusion_bulgarian_analyzer": {
+          "type": "bulgarian",
+          "stem_exclusion": ["авторитет", "одобрение"]
+        }
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Bulgarian analyzer internals
+
+The `bulgarian` analyzer is built using the following components:
+
+- Tokenizer: `standard`
+
+- Token filters:
+  - lowercase
+  - stop (Bulgarian)
+  - keyword
+  - stemmer (Bulgarian)
+
+## Custom Bulgarian analyzer
+
+You can create a custom Bulgarian analyzer using the following command:
+
+```json
+PUT /bulgarian-index
+{
+  "settings": {
+    "analysis": {
+      "filter": {
+        "bulgarian_stop": {
+          "type": "stop",
+          "stopwords": "_bulgarian_"
+        },
+        "bulgarian_stemmer": {
+          "type": "stemmer",
+          "language": "bulgarian"
+        },
+        "bulgarian_keywords": {
+          "type":       "keyword_marker",
+          "keywords":   [] 
+        }
+      },
+      "analyzer": {
+        "bulgarian_analyzer": {
+          "type": "custom",
+          "tokenizer": "standard",
+          "filter": [
+            "lowercase",
+            "bulgarian_stop",
+            "bulgarian_keywords",
+            "bulgarian_stemmer"
+          ]
+        }
+      }
+    }
+  },
+  "mappings": {
+    "properties": {
+      "content": {
+        "type": "text",
+        "analyzer": "bulgarian_analyzer"
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Generated tokens
+
+Use the following request to examine the tokens generated using the analyzer:
+
+```json
+POST /bulgarian-index/_analyze
+{
+  "field": "content",
+  "text": "Студентите учат в българските университети. Техните номера са 123456."
+}
+```
+{% include copy-curl.html %}
+
+The response contains the generated tokens:
+
+```json
+{
+  "tokens": [
+    {"token": "студент","start_offset": 0,"end_offset": 10,"type": "<ALPHANUM>","position": 0},
+    {"token": "учат","start_offset": 11,"end_offset": 15,"type": "<ALPHANUM>","position": 1},
+    {"token": "българск","start_offset": 18,"end_offset": 29,"type": "<ALPHANUM>","position": 3},
+    {"token": "университят","start_offset": 30,"end_offset": 42,"type": "<ALPHANUM>","position": 4},
+    {"token": "техн","start_offset": 44,"end_offset": 51,"type": "<ALPHANUM>","position": 5},
+    {"token": "номер","start_offset": 52,"end_offset": 58,"type": "<ALPHANUM>","position": 6},
+    {"token": "123456","start_offset": 62,"end_offset": 68,"type": "<NUM>","position": 8}
+  ]
+}
+```
\ No newline at end of file
diff --git a/_analyzers/language-analyzers/catalan.md b/_analyzers/language-analyzers/catalan.md
new file mode 100644
index 0000000000..89762da094
--- /dev/null
+++ b/_analyzers/language-analyzers/catalan.md
@@ -0,0 +1,143 @@
+---
+layout: default
+title: Catalan
+parent: Language analyzers
+grand_parent: Analyzers
+nav_order: 70
+---
+
+# Catalan analyzer
+
+The built-in `catalan` analyzer can be applied to a text field using the following command:
+
+```json
+PUT /catalan-index
+{
+  "mappings": {
+    "properties": {
+      "content": {
+        "type": "text",
+        "analyzer": "catalan"
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Stem exclusion
+
+You can use `stem_exclusion` with this language analyzer using the following command:
+
+```json
+PUT index_with_stem_exclusion_catalan_analyzer
+{
+  "settings": {
+    "analysis": {
+      "analyzer": {
+        "stem_exclusion_catalan_analyzer": {
+          "type": "catalan",
+          "stem_exclusion": ["autoritat", "aprovació"]
+        }
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Catalan analyzer internals
+
+The `catalan` analyzer is built using the following components:
+
+- Tokenizer: `standard`
+
+- Token filters:
+  - elision (Catalan)
+  - lowercase
+  - stop (Catalan)
+  - keyword
+  - stemmer (Catalan)
+
+## Custom Catalan analyzer
+
+You can create a custom Catalan analyzer using the following command:
+
+```json
+PUT /catalan-index
+{
+  "settings": {
+    "analysis": {
+      "filter": {
+        "catalan_stop": {
+          "type": "stop",
+          "stopwords": "_catalan_"
+        },
+        "catalan_elision": {
+          "type":       "elision",
+          "articles":   [ "d", "l", "m", "n", "s", "t"],
+          "articles_case": true
+        },
+        "catalan_stemmer": {
+          "type": "stemmer",
+          "language": "catalan"
+        },
+        "catalan_keywords": {
+          "type":       "keyword_marker",
+          "keywords":   [] 
+        }
+      },
+      "analyzer": {
+        "catalan_analyzer": {
+          "type": "custom",
+          "tokenizer": "standard",
+          "filter": [
+            "catalan_elision",
+            "lowercase",
+            "catalan_stop",
+            "catalan_keywords",
+            "catalan_stemmer"
+          ]
+        }
+      }
+    }
+  },
+  "mappings": {
+    "properties": {
+      "content": {
+        "type": "text",
+        "analyzer": "catalan_analyzer"
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Generated tokens
+
+Use the following request to examine the tokens generated using the analyzer:
+
+```json
+POST /catalan-index/_analyze
+{
+  "field": "content",
+  "text": "Els estudiants estudien a les universitats catalanes. Els seus números són 123456."
+}
+```
+{% include copy-curl.html %}
+
+The response contains the generated tokens:
+
+```json
+{
+  "tokens": [
+    {"token": "estud","start_offset": 4,"end_offset": 14,"type": "<ALPHANUM>","position": 1},
+    {"token": "estud","start_offset": 15,"end_offset": 23,"type": "<ALPHANUM>","position": 2},
+    {"token": "univer","start_offset": 30,"end_offset": 42,"type": "<ALPHANUM>","position": 5},
+    {"token": "catalan","start_offset": 43,"end_offset": 52,"type": "<ALPHANUM>","position": 6},
+    {"token": "numer","start_offset": 63,"end_offset": 70,"type": "<ALPHANUM>","position": 9},
+    {"token": "123456","start_offset": 75,"end_offset": 81,"type": "<NUM>","position": 11}
+  ]
+}
+```
\ No newline at end of file
diff --git a/_analyzers/language-analyzers/cjk.md b/_analyzers/language-analyzers/cjk.md
new file mode 100644
index 0000000000..aed7e6da22
--- /dev/null
+++ b/_analyzers/language-analyzers/cjk.md
@@ -0,0 +1,142 @@
+---
+layout: default
+title: CJK
+parent: Language analyzers
+grand_parent: Analyzers
+nav_order: 80
+---
+
+# CJK analyzer
+
+The built-in `cjk` analyzer can be applied to a text field using the following command:
+
+```json
+PUT /cjk-index
+{
+  "mappings": {
+    "properties": {
+      "content": {
+        "type": "text",
+        "analyzer": "cjk"
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Stem exclusion
+
+You can use `stem_exclusion` with this language analyzer using the following command:
+
+```json
+PUT index_with_stem_exclusion_cjk_analyzer
+{
+  "settings": {
+    "analysis": {
+      "analyzer": {
+        "stem_exclusion_cjk_analyzer": {
+          "type": "cjk",
+          "stem_exclusion": ["example", "words"]
+        }
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## CJK analyzer internals
+
+The `cjk` analyzer is built using the following components:
+
+- Tokenizer: `standard`
+
+- Token filters:
+  - cjk_width
+  - lowercase
+  - cjk_bigram
+  - stop (similar to English)
+
+## Custom CJK analyzer
+
+You can create a custom CJK analyzer using the following command:
+
+```json
+PUT /cjk-index
+{
+  "settings": {
+    "analysis": {
+      "filter": {
+        "english_stop": {
+          "type":       "stop",
+          "stopwords":  [ 
+            "a", "and", "are", "as", "at", "be", "but", "by", "for",
+            "if", "in", "into", "is", "it", "no", "not", "of", "on",
+            "or", "s", "such", "t", "that", "the", "their", "then",
+            "there", "these", "they", "this", "to", "was", "will",
+            "with", "www"
+          ]
+        }
+      },
+      "analyzer": {
+        "cjk_custom_analyzer": {
+          "tokenizer": "standard",
+          "filter": [
+            "cjk_width",
+            "lowercase",
+            "cjk_bigram",
+            "english_stop"
+          ]
+        }
+      }
+    }
+  },
+  "mappings": {
+    "properties": {
+      "content": {
+        "type": "text",
+        "analyzer": "cjk_custom_analyzer"
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Generated tokens
+
+Use the following request to examine the tokens generated using the analyzer:
+
+```json
+POST /cjk-index/_analyze
+{
+  "field": "content",
+  "text": "学生们在中国、日本和韩国的大学学习。123456"
+}
+```
+{% include copy-curl.html %}
+
+The response contains the generated tokens:
+
+```json
+{
+  "tokens": [
+    {"token": "学生","start_offset": 0,"end_offset": 2,"type": "<DOUBLE>","position": 0},
+    {"token": "生们","start_offset": 1,"end_offset": 3,"type": "<DOUBLE>","position": 1},
+    {"token": "们在","start_offset": 2,"end_offset": 4,"type": "<DOUBLE>","position": 2},
+    {"token": "在中","start_offset": 3,"end_offset": 5,"type": "<DOUBLE>","position": 3},
+    {"token": "中国","start_offset": 4,"end_offset": 6,"type": "<DOUBLE>","position": 4},
+    {"token": "日本","start_offset": 7,"end_offset": 9,"type": "<DOUBLE>","position": 5},
+    {"token": "本和","start_offset": 8,"end_offset": 10,"type": "<DOUBLE>","position": 6},
+    {"token": "和韩","start_offset": 9,"end_offset": 11,"type": "<DOUBLE>","position": 7},
+    {"token": "韩国","start_offset": 10,"end_offset": 12,"type": "<DOUBLE>","position": 8},
+    {"token": "国的","start_offset": 11,"end_offset": 13,"type": "<DOUBLE>","position": 9},
+    {"token": "的大","start_offset": 12,"end_offset": 14,"type": "<DOUBLE>","position": 10},
+    {"token": "大学","start_offset": 13,"end_offset": 15,"type": "<DOUBLE>","position": 11},
+    {"token": "学学","start_offset": 14,"end_offset": 16,"type": "<DOUBLE>","position": 12},
+    {"token": "学习","start_offset": 15,"end_offset": 17,"type": "<DOUBLE>","position": 13},
+    {"token": "123456","start_offset": 18,"end_offset": 24,"type": "<NUM>","position": 14}
+  ]
+}
+```
\ No newline at end of file
diff --git a/_analyzers/language-analyzers/czech.md b/_analyzers/language-analyzers/czech.md
new file mode 100644
index 0000000000..c1778cd0f4
--- /dev/null
+++ b/_analyzers/language-analyzers/czech.md
@@ -0,0 +1,172 @@
+---
+layout: default
+title: Czech
+parent: Language analyzers
+grand_parent: Analyzers
+nav_order: 90
+---
+
+# Czech analyzer
+
+The built-in `czech` analyzer can be applied to a text field using the following command:
+
+```json
+PUT /czech-index
+{
+  "mappings": {
+    "properties": {
+      "content": {
+        "type": "text",
+        "analyzer": "czech"
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Stem exclusion
+
+You can use `stem_exclusion` with this language analyzer using the following command:
+
+```json
+PUT index_with_stem_exclusion_czech_analyzer
+{
+  "settings": {
+    "analysis": {
+      "analyzer": {
+        "stem_exclusion_czech_analyzer": {
+          "type": "czech",
+          "stem_exclusion": ["autorita", "schválení"]
+        }
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Czech analyzer internals
+
+The `czech` analyzer is built using the following components:
+
+- Tokenizer: `standard`
+
+- Token filters:
+  - lowercase
+  - stop (Czech)
+  - keyword
+  - stemmer (Czech)
+
+## Custom Czech analyzer
+
+You can create a custom Czech analyzer using the following command:
+
+```json
+PUT /czech-index
+{
+  "settings": {
+    "analysis": {
+      "filter": {
+        "czech_stop": {
+          "type": "stop",
+          "stopwords": "_czech_"
+        },
+        "czech_stemmer": {
+          "type": "stemmer",
+          "language": "czech"
+        },
+        "czech_keywords": {
+          "type":       "keyword_marker",
+          "keywords":   [] 
+        }
+      },
+      "analyzer": {
+        "czech_analyzer": {
+          "type": "custom",
+          "tokenizer": "standard",
+          "filter": [
+            "lowercase",
+            "czech_stop",
+            "czech_keywords",
+            "czech_stemmer"
+          ]
+        }
+      }
+    }
+  },
+  "mappings": {
+    "properties": {
+      "content": {
+        "type": "text",
+        "analyzer": "czech_analyzer"
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Generated tokens
+
+Use the following request to examine the tokens generated using the analyzer:
+
+```json
+POST /czech-index/_analyze
+{
+  "field": "content",
+  "text": "Studenti studují na českých univerzitách. Jejich čísla jsou 123456."
+}
+```
+{% include copy-curl.html %}
+
+The response contains the generated tokens:
+
+```json
+{
+  "tokens": [
+    {
+      "token": "student",
+      "start_offset": 0,
+      "end_offset": 8,
+      "type": "<ALPHANUM>",
+      "position": 0
+    },
+    {
+      "token": "studuj",
+      "start_offset": 9,
+      "end_offset": 16,
+      "type": "<ALPHANUM>",
+      "position": 1
+    },
+    {
+      "token": "česk",
+      "start_offset": 20,
+      "end_offset": 27,
+      "type": "<ALPHANUM>",
+      "position": 3
+    },
+    {
+      "token": "univerzit",
+      "start_offset": 28,
+      "end_offset": 40,
+      "type": "<ALPHANUM>",
+      "position": 4
+    },
+    {
+      "token": "čísl",
+      "start_offset": 49,
+      "end_offset": 54,
+      "type": "<ALPHANUM>",
+      "position": 6
+    },
+    {
+      "token": "123456",
+      "start_offset": 60,
+      "end_offset": 66,
+      "type": "<NUM>",
+      "position": 8
+    }
+  ]
+}
+```
\ No newline at end of file
diff --git a/_analyzers/language-analyzers/danish.md b/_analyzers/language-analyzers/danish.md
new file mode 100644
index 0000000000..b5ee1b0e97
--- /dev/null
+++ b/_analyzers/language-analyzers/danish.md
@@ -0,0 +1,172 @@
+---
+layout: default
+title: Danish
+parent: Language analyzers
+grand_parent: Analyzers
+nav_order: 100
+---
+
+# Danish analyzer
+
+The built-in `danish` analyzer can be applied to a text field using the following command:
+
+```json
+PUT /danish-index
+{
+  "mappings": {
+    "properties": {
+      "content": {
+        "type": "text",
+        "analyzer": "danish"
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Stem exclusion
+
+You can use `stem_exclusion` with this language analyzer using the following command:
+
+```json
+PUT index_with_stem_exclusion_danish_analyzer
+{
+  "settings": {
+    "analysis": {
+      "analyzer": {
+        "stem_exclusion_danish_analyzer": {
+          "type": "danish",
+          "stem_exclusion": ["autoritet", "godkendelse"]
+        }
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Danish analyzer internals
+
+The `danish` analyzer is built using the following components:
+
+- Tokenizer: `standard`
+
+- Token filters:
+  - lowercase
+  - stop (Danish)
+  - keyword
+  - stemmer (Danish)
+
+## Custom Danish analyzer
+
+You can create a custom Danish analyzer using the following command:
+
+```json
+PUT /danish-index
+{
+  "settings": {
+    "analysis": {
+      "filter": {
+        "danish_stop": {
+          "type": "stop",
+          "stopwords": "_danish_"
+        },
+        "danish_stemmer": {
+          "type": "stemmer",
+          "language": "danish"
+        },
+        "danish_keywords": {
+          "type":       "keyword_marker",
+          "keywords":   [] 
+        }
+      },
+      "analyzer": {
+        "danish_analyzer": {
+          "type": "custom",
+          "tokenizer": "standard",
+          "filter": [
+            "lowercase",
+            "danish_stop",
+            "danish_keywords",
+            "danish_stemmer"
+          ]
+        }
+      }
+    }
+  },
+  "mappings": {
+    "properties": {
+      "content": {
+        "type": "text",
+        "analyzer": "danish_analyzer"
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Generated tokens
+
+Use the following request to examine the tokens generated using the analyzer:
+
+```json
+POST /danish-index/_analyze
+{
+  "field": "content",
+  "text": "Studerende studerer på de danske universiteter. Deres numre er 123456."
+}
+```
+{% include copy-curl.html %}
+
+The response contains the generated tokens:
+
+```json
+{
+  "tokens": [
+    {
+      "token": "stud",
+      "start_offset": 0,
+      "end_offset": 10,
+      "type": "<ALPHANUM>",
+      "position": 0
+    },
+    {
+      "token": "stud",
+      "start_offset": 11,
+      "end_offset": 19,
+      "type": "<ALPHANUM>",
+      "position": 1
+    },
+    {
+      "token": "dansk",
+      "start_offset": 26,
+      "end_offset": 32,
+      "type": "<ALPHANUM>",
+      "position": 4
+    },
+    {
+      "token": "universitet",
+      "start_offset": 33,
+      "end_offset": 46,
+      "type": "<ALPHANUM>",
+      "position": 5
+    },
+    {
+      "token": "numr",
+      "start_offset": 54,
+      "end_offset": 59,
+      "type": "<ALPHANUM>",
+      "position": 7
+    },
+    {
+      "token": "123456",
+      "start_offset": 63,
+      "end_offset": 69,
+      "type": "<NUM>",
+      "position": 9
+    }
+  ]
+}
+```
\ No newline at end of file
diff --git a/_analyzers/language-analyzers/dutch.md b/_analyzers/language-analyzers/dutch.md
new file mode 100644
index 0000000000..0259707d78
--- /dev/null
+++ b/_analyzers/language-analyzers/dutch.md
@@ -0,0 +1,148 @@
+---
+layout: default
+title: Dutch
+parent: Language analyzers
+grand_parent: Analyzers
+nav_order: 110
+---
+
+# Dutch analyzer
+
+The built-in `dutch` analyzer can be applied to a text field using the following command:
+
+```json
+PUT /dutch-index
+{
+  "mappings": {
+    "properties": {
+      "content": {
+        "type": "text",
+        "analyzer": "dutch"
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Stem exclusion
+
+You can use `stem_exclusion` with this language analyzer using the following command:
+
+```json
+PUT index_with_stem_exclusion_dutch_analyzer
+{
+  "settings": {
+    "analysis": {
+      "analyzer": {
+        "stem_exclusion_dutch_analyzer": {
+          "type": "dutch",
+          "stem_exclusion": ["autoriteit", "goedkeuring"]
+        }
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Dutch analyzer internals
+
+The `dutch` analyzer is built using the following components:
+
+- Tokenizer: `standard`
+
+- Token filters:
+  - lowercase
+  - stop (Dutch)
+  - keyword
+  - stemmer_override
+  - stemmer (Dutch)
+
+## Custom Dutch analyzer
+
+You can create a custom Dutch analyzer using the following command:
+
+```json
+PUT /dutch-index
+{
+  "settings": {
+    "analysis": {
+      "filter": {
+        "dutch_stop": {
+          "type": "stop",
+          "stopwords": "_dutch_"
+        },
+        "dutch_stemmer": {
+          "type": "stemmer",
+          "language": "dutch"
+        },
+        "dutch_keywords": {
+          "type": "keyword_marker",
+          "keywords": []
+        },
+        "dutch_override": {
+          "type": "stemmer_override",
+          "rules": [
+            "fiets=>fiets",
+            "bromfiets=>bromfiets",
+            "ei=>eier",
+            "kind=>kinder"
+          ]
+        }
+      },
+      "analyzer": {
+        "dutch_analyzer": {
+          "type": "custom",
+          "tokenizer": "standard",
+          "filter": [
+            "lowercase",
+            "dutch_stop",
+            "dutch_keywords",
+            "dutch_override",
+            "dutch_stemmer"
+          ]
+        }
+      }
+    }
+  },
+  "mappings": {
+    "properties": {
+      "content": {
+        "type": "text",
+        "analyzer": "dutch_analyzer"
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Generated tokens
+
+Use the following request to examine the tokens generated using the analyzer:
+
+```json
+POST /dutch-index/_analyze
+{
+  "field": "content",
+  "text": "De studenten studeren in Nederland en bezoeken Amsterdam. Hun nummers zijn 123456."
+}
+```
+{% include copy-curl.html %}
+
+The response contains the generated tokens:
+
+```json
+{
+  "tokens": [
+    {"token": "student","start_offset": 3,"end_offset": 12,"type": "<ALPHANUM>","position": 1},
+    {"token": "studer","start_offset": 13,"end_offset": 21,"type": "<ALPHANUM>","position": 2},
+    {"token": "nederland","start_offset": 25,"end_offset": 34,"type": "<ALPHANUM>","position": 4},
+    {"token": "bezoek","start_offset": 38,"end_offset": 46,"type": "<ALPHANUM>","position": 6},
+    {"token": "amsterdam","start_offset": 47,"end_offset": 56,"type": "<ALPHANUM>","position": 7},
+    {"token": "nummer","start_offset": 62,"end_offset": 69,"type": "<ALPHANUM>","position": 9},
+    {"token": "123456","start_offset": 75,"end_offset": 81,"type": "<NUM>","position": 11}
+  ]
+}
+```
\ No newline at end of file
diff --git a/_analyzers/language-analyzers/english.md b/_analyzers/language-analyzers/english.md
new file mode 100644
index 0000000000..2d0b600312
--- /dev/null
+++ b/_analyzers/language-analyzers/english.md
@@ -0,0 +1,143 @@
+---
+layout: default
+title: English
+parent: Language analyzers
+grand_parent: Analyzers
+nav_order: 120
+---
+
+# English analyzer
+
+The built-in `english` analyzer can be applied to a text field using the following command:
+
+```json
+PUT /english-index
+{
+  "mappings": {
+    "properties": {
+      "content": {
+        "type": "text",
+        "analyzer": "english"
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Stem exclusion
+
+You can use `stem_exclusion` with this language analyzer using the following command:
+
+```json
+PUT index_with_stem_exclusion_english_analyzer
+{
+  "settings": {
+    "analysis": {
+      "analyzer": {
+        "stem_exclusion_english_analyzer": {
+          "type": "english",
+          "stem_exclusion": ["authority", "authorization"]
+        }
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## English analyzer internals
+
+The `english` analyzer is built using the following components:
+
+- Tokenizer: `standard`
+
+- Token filters:
+  - stemmer (possessive_english)
+  - lowercase
+  - stop (English)
+  - keyword
+  - stemmer (English)
+
+## Custom English analyzer
+
+You can create a custom English analyzer using the following command:
+
+```json
+PUT /english-index
+{
+  "settings": {
+    "analysis": {
+      "filter": {
+        "english_stop": {
+          "type": "stop",
+          "stopwords": "_english_"
+        },
+        "english_stemmer": {
+          "type": "stemmer",
+          "language": "english"
+        },
+        "english_keywords": {
+          "type": "keyword_marker",
+          "keywords": []
+        },
+        "english_possessive_stemmer": {
+          "type":       "stemmer",
+          "language":   "possessive_english"
+        }
+      },
+      "analyzer": {
+        "english_analyzer": {
+          "type": "custom",
+          "tokenizer": "standard",
+          "filter": [
+            "english_possessive_stemmer",
+            "lowercase",
+            "english_stop",
+            "english_keywords",
+            "english_stemmer"
+          ]
+        }
+      }
+    }
+  },
+  "mappings": {
+    "properties": {
+      "content": {
+        "type": "text",
+        "analyzer": "english_analyzer"
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Generated tokens
+
+Use the following request to examine the tokens generated using the analyzer:
+
+```json
+POST /english-index/_analyze
+{
+  "field": "content",
+  "text": "The students study in the USA and work at NASA. Their numbers are 123456."
+}
+```
+{% include copy-curl.html %}
+
+The response contains the generated tokens:
+
+```json
+{
+  "tokens": [
+    {"token": "student","start_offset": 4,"end_offset": 12,"type": "<ALPHANUM>","position": 1},
+    {"token": "studi","start_offset": 13,"end_offset": 18,"type": "<ALPHANUM>","position": 2},
+    {"token": "usa","start_offset": 26,"end_offset": 29,"type": "<ALPHANUM>","position": 5},
+    {"token": "work","start_offset": 34,"end_offset": 38,"type": "<ALPHANUM>","position": 7},
+    {"token": "nasa","start_offset": 42,"end_offset": 46,"type": "<ALPHANUM>","position": 9},
+    {"token": "number","start_offset": 54,"end_offset": 61,"type": "<ALPHANUM>","position": 11},
+    {"token": "123456","start_offset": 66,"end_offset": 72,"type": "<NUM>","position": 13}
+  ]
+}
+```
\ No newline at end of file
diff --git a/_analyzers/language-analyzers/estonian.md b/_analyzers/language-analyzers/estonian.md
new file mode 100644
index 0000000000..a4cb664f18
--- /dev/null
+++ b/_analyzers/language-analyzers/estonian.md
@@ -0,0 +1,139 @@
+---
+layout: default
+title: Estonian
+parent: Language analyzers
+grand_parent: Analyzers
+nav_order: 130
+---
+
+# Estonian analyzer
+
+The built-in `estonian` analyzer can be applied to a text field using the following command:
+
+```json
+PUT /estonian-index
+{
+  "mappings": {
+    "properties": {
+      "content": {
+        "type": "text",
+        "analyzer": "estonian"
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Stem exclusion
+
+You can use `stem_exclusion` with this language analyzer using the following command:
+
+```json
+PUT index_with_stem_exclusion_estonian_analyzer
+{
+  "settings": {
+    "analysis": {
+      "analyzer": {
+        "stem_exclusion_estonian_analyzer": {
+          "type": "estonian",
+          "stem_exclusion": ["autoriteet", "kinnitus"]
+        }
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Estonian analyzer internals
+
+The `estonian` analyzer is built using the following components:
+
+- Tokenizer: `standard`
+
+- Token filters:
+  - lowercase
+  - stop (Estonian)
+  - keyword
+  - stemmer (Estonian)
+
+## Custom Estonian analyzer
+
+You can create a custom Estonian analyzer using the following command:
+
+```json
+PUT /estonian-index
+{
+  "settings": {
+    "analysis": {
+      "filter": {
+        "estonian_stop": {
+          "type": "stop",
+          "stopwords": "_estonian_"
+        },
+        "estonian_stemmer": {
+          "type": "stemmer",
+          "language": "estonian"
+        },
+        "estonian_keywords": {
+          "type": "keyword_marker",
+          "keywords": []
+        }
+      },
+      "analyzer": {
+        "estonian_analyzer": {
+          "type": "custom",
+          "tokenizer": "standard",
+          "filter": [
+            "lowercase",
+            "estonian_stop",
+            "estonian_keywords",
+            "estonian_stemmer"
+          ]
+        }
+      }
+    }
+  },
+  "mappings": {
+    "properties": {
+      "content": {
+        "type": "text",
+        "analyzer": "estonian_analyzer"
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Generated tokens
+
+Use the following request to examine the tokens generated using the analyzer:
+
+```json
+POST /estonian-index/_analyze
+{
+  "field": "content",
+  "text": "Õpilased õpivad Tallinnas ja Eesti ülikoolides. Nende numbrid on 123456."
+}
+```
+{% include copy-curl.html %}
+
+The response contains the generated tokens:
+
+```json
+{
+  "tokens": [
+    {"token": "õpilase","start_offset": 0,"end_offset": 8,"type": "<ALPHANUM>","position": 0},
+    {"token": "õpi","start_offset": 9,"end_offset": 15,"type": "<ALPHANUM>","position": 1},
+    {"token": "tallinna","start_offset": 16,"end_offset": 25,"type": "<ALPHANUM>","position": 2},
+    {"token": "eesti","start_offset": 29,"end_offset": 34,"type": "<ALPHANUM>","position": 4},
+    {"token": "ülikooli","start_offset": 35,"end_offset": 46,"type": "<ALPHANUM>","position": 5},
+    {"token": "nende","start_offset": 48,"end_offset": 53,"type": "<ALPHANUM>","position": 6},
+    {"token": "numbri","start_offset": 54,"end_offset": 61,"type": "<ALPHANUM>","position": 7},
+    {"token": "on","start_offset": 62,"end_offset": 64,"type": "<ALPHANUM>","position": 8},
+    {"token": "123456","start_offset": 65,"end_offset": 71,"type": "<NUM>","position": 9}
+  ]
+}
+```
\ No newline at end of file
diff --git a/_analyzers/language-analyzers/finnish.md b/_analyzers/language-analyzers/finnish.md
new file mode 100644
index 0000000000..6f559650d2
--- /dev/null
+++ b/_analyzers/language-analyzers/finnish.md
@@ -0,0 +1,137 @@
+---
+layout: default
+title: Finnish
+parent: Language analyzers
+grand_parent: Analyzers
+nav_order: 140
+---
+
+# Finnish analyzer
+
+The built-in `finnish` analyzer can be applied to a text field using the following command:
+
+```json
+PUT /finnish-index
+{
+  "mappings": {
+    "properties": {
+      "content": {
+        "type": "text",
+        "analyzer": "finnish"
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Stem exclusion
+
+You can use `stem_exclusion` with this language analyzer using the following command:
+
+```json
+PUT index_with_stem_exclusion_finnish_analyzer
+{
+  "settings": {
+    "analysis": {
+      "analyzer": {
+        "stem_exclusion_finnish_analyzer": {
+          "type": "finnish",
+          "stem_exclusion": ["valta", "hyväksyntä"]
+        }
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Finnish analyzer internals
+
+The `finnish` analyzer is built using the following components:
+
+- Tokenizer: `standard`
+
+- Token filters:
+  - lowercase
+  - stop (Finnish)
+  - keyword
+  - stemmer (Finnish)
+
+## Custom Finnish analyzer
+
+You can create a custom Finnish analyzer using the following command:
+
+```json
+PUT /finnish-index
+{
+  "settings": {
+    "analysis": {
+      "filter": {
+        "finnish_stop": {
+          "type": "stop",
+          "stopwords": "_finnish_"
+        },
+        "finnish_stemmer": {
+          "type": "stemmer",
+          "language": "finnish"
+        },
+        "finnish_keywords": {
+          "type": "keyword_marker",
+          "keywords": ["Helsinki", "Suomi"]
+        }
+      },
+      "analyzer": {
+        "finnish_analyzer": {
+          "type": "custom",
+          "tokenizer": "standard",
+          "filter": [
+            "lowercase",
+            "finnish_stop",
+            "finnish_keywords",
+            "finnish_stemmer"
+          ]
+        }
+      }
+    }
+  },
+  "mappings": {
+    "properties": {
+      "content": {
+        "type": "text",
+        "analyzer": "finnish_analyzer"
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Generated tokens
+
+Use the following request to examine the tokens generated using the analyzer:
+
+```json
+POST /finnish-index/_analyze
+{
+  "field": "content",
+  "text": "Opiskelijat opiskelevat Helsingissä ja Suomen yliopistoissa. Heidän numeronsa ovat 123456."
+}
+```
+{% include copy-curl.html %}
+
+The response contains the generated tokens:
+
+```json
+{
+  "tokens": [
+    {"token": "opiskelij","start_offset": 0,"end_offset": 11,"type": "<ALPHANUM>","position": 0},
+    {"token": "opiskelev","start_offset": 12,"end_offset": 23,"type": "<ALPHANUM>","position": 1},
+    {"token": "helsing","start_offset": 24,"end_offset": 35,"type": "<ALPHANUM>","position": 2},
+    {"token": "suome","start_offset": 39,"end_offset": 45,"type": "<ALPHANUM>","position": 4},
+    {"token": "yliopisto","start_offset": 46,"end_offset": 59,"type": "<ALPHANUM>","position": 5},
+    {"token": "numero","start_offset": 68,"end_offset": 77,"type": "<ALPHANUM>","position": 7},
+    {"token": "123456","start_offset": 83,"end_offset": 89,"type": "<NUM>","position": 9}
+  ]
+}
+```
\ No newline at end of file
diff --git a/_analyzers/language-analyzers/french.md b/_analyzers/language-analyzers/french.md
new file mode 100644
index 0000000000..64e7ab5415
--- /dev/null
+++ b/_analyzers/language-analyzers/french.md
@@ -0,0 +1,148 @@
+---
+layout: default
+title: French
+parent: Language analyzers
+grand_parent: Analyzers
+nav_order: 150
+---
+
+# French analyzer
+
+The built-in `french` analyzer can be applied to a text field using the following command:
+
+```json
+PUT /french-index
+{
+  "mappings": {
+    "properties": {
+      "content": {
+        "type": "text",
+        "analyzer": "french"
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Stem exclusion
+
+You can use `stem_exclusion` with this language analyzer using the following command:
+
+```json
+PUT index_with_stem_exclusion_french_analyzer
+{
+  "settings": {
+    "analysis": {
+      "analyzer": {
+        "stem_exclusion_french_analyzer": {
+          "type": "french",
+          "stem_exclusion": ["autorité", "acceptation"]
+        }
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## French analyzer internals
+
+The `french` analyzer is built using the following components:
+
+- Tokenizer: `standard`
+
+- Token filters:
+  - elision (French)
+  - lowercase
+  - stop (French)
+  - keyword
+  - stemmer (French)
+
+## Custom French analyzer
+
+You can create a custom French analyzer using the following command:
+
+```json
+PUT /french-index
+{
+  "settings": {
+    "analysis": {
+      "filter": {
+        "french_stop": {
+          "type": "stop",
+          "stopwords": "_french_"
+        },
+        "french_elision": {
+          "type":         "elision",
+          "articles_case": true,
+          "articles": [
+              "l", "m", "t", "qu", "n", "s",
+              "j", "d", "c", "jusqu", "quoiqu",
+              "lorsqu", "puisqu"
+            ]
+        },
+        "french_stemmer": {
+          "type": "stemmer",
+          "language": "light_french"
+        },
+        "french_keywords": {
+          "type": "keyword_marker",
+          "keywords": []
+        }
+      },
+      "analyzer": {
+        "french_analyzer": {
+          "type": "custom",
+          "tokenizer": "standard",
+          "filter": [
+            "french_elision",
+            "lowercase",
+            "french_stop",
+            "french_keywords",
+            "french_stemmer"
+          ]
+        }
+      }
+    }
+  },
+  "mappings": {
+    "properties": {
+      "content": {
+        "type": "text",
+        "analyzer": "french_analyzer"
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Generated tokens
+
+Use the following request to examine the tokens generated using the analyzer:
+
+```json
+POST /french-index/_analyze
+{
+  "field": "content",
+  "text": "Les étudiants étudient à Paris et dans les universités françaises. Leurs numéros sont 123456."
+}
+```
+{% include copy-curl.html %}
+
+The response contains the generated tokens:
+
+```json
+{
+  "tokens": [
+    {"token": "etudiant","start_offset": 4,"end_offset": 13,"type": "<ALPHANUM>","position": 1},
+    {"token": "etudient","start_offset": 14,"end_offset": 22,"type": "<ALPHANUM>","position": 2},
+    {"token": "pari","start_offset": 25,"end_offset": 30,"type": "<ALPHANUM>","position": 4},
+    {"token": "universit","start_offset": 43,"end_offset": 54,"type": "<ALPHANUM>","position": 8},
+    {"token": "francais","start_offset": 55,"end_offset": 65,"type": "<ALPHANUM>","position": 9},
+    {"token": "numero","start_offset": 73,"end_offset": 80,"type": "<ALPHANUM>","position": 11},
+    {"token": "123456","start_offset": 86,"end_offset": 92,"type": "<NUM>","position": 13}
+  ]
+}
+```
\ No newline at end of file
diff --git a/_analyzers/language-analyzers/galician.md b/_analyzers/language-analyzers/galician.md
new file mode 100644
index 0000000000..00338b23a7
--- /dev/null
+++ b/_analyzers/language-analyzers/galician.md
@@ -0,0 +1,138 @@
+---
+layout: default
+title: Galician
+parent: Language analyzers
+grand_parent: Analyzers
+nav_order: 160
+---
+
+# Galician analyzer
+
+The built-in `galician` analyzer can be applied to a text field using the following command:
+
+```json
+PUT /galician-index
+{
+  "mappings": {
+    "properties": {
+      "content": {
+        "type": "text",
+        "analyzer": "galician"
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Stem exclusion
+
+You can use `stem_exclusion` with this language analyzer using the following command:
+
+```json
+PUT index_with_stem_exclusion_galician_analyzer
+{
+  "settings": {
+    "analysis": {
+      "analyzer": {
+        "stem_exclusion_galician_analyzer": {
+          "type": "galician",
+          "stem_exclusion": ["autoridade", "aceptación"]
+        }
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Galician analyzer internals
+
+The `galician` analyzer is built using the following components:
+
+- Tokenizer: `standard`
+
+- Token filters:
+  - lowercase
+  - stop (French)
+  - keyword
+  - stemmer (French)
+
+## Custom Galician analyzer
+
+You can create a custom Galician analyzer using the following command:
+
+```json
+PUT /galician-index
+{
+  "settings": {
+    "analysis": {
+      "filter": {
+        "galician_stop": {
+          "type": "stop",
+          "stopwords": "_galician_"
+        },
+        "galician_stemmer": {
+          "type": "stemmer",
+          "language": "galician"
+        },
+        "galician_keywords": {
+          "type": "keyword_marker",
+          "keywords": []
+        }
+      },
+      "analyzer": {
+        "galician_analyzer": {
+          "type": "custom",
+          "tokenizer": "standard",
+          "filter": [
+            "lowercase",
+            "galician_stop",
+            "galician_keywords",
+            "galician_stemmer"
+          ]
+        }
+      }
+    }
+  },
+  "mappings": {
+    "properties": {
+      "content": {
+        "type": "text",
+        "analyzer": "galician_analyzer"
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Generated tokens
+
+Use the following request to examine the tokens generated using the analyzer:
+
+```json
+POST /galician-index/_analyze
+{
+  "field": "content",
+  "text": "Os estudantes estudan en Santiago e nas universidades galegas. Os seus números son 123456."
+}
+```
+{% include copy-curl.html %}
+
+The response contains the generated tokens:
+
+```json
+{
+  "tokens": [
+    {"token": "estud","start_offset": 3,"end_offset": 13,"type": "<ALPHANUM>","position": 1},
+    {"token": "estud","start_offset": 14,"end_offset": 21,"type": "<ALPHANUM>","position": 2},
+    {"token": "santiag","start_offset": 25,"end_offset": 33,"type": "<ALPHANUM>","position": 4},
+    {"token": "univers","start_offset": 40,"end_offset": 53,"type": "<ALPHANUM>","position": 7},
+    {"token": "galeg","start_offset": 54,"end_offset": 61,"type": "<ALPHANUM>","position": 8},
+    {"token": "numer","start_offset": 71,"end_offset": 78,"type": "<ALPHANUM>","position": 11},
+    {"token": "son","start_offset": 79,"end_offset": 82,"type": "<ALPHANUM>","position": 12},
+    {"token": "123456","start_offset": 83,"end_offset": 89,"type": "<NUM>","position": 13}
+  ]
+}
+```
\ No newline at end of file
diff --git a/_analyzers/language-analyzers/german.md b/_analyzers/language-analyzers/german.md
new file mode 100644
index 0000000000..4071ef5378
--- /dev/null
+++ b/_analyzers/language-analyzers/german.md
@@ -0,0 +1,174 @@
+---
+layout: default
+title: German
+parent: Language analyzers
+grand_parent: Analyzers
+nav_order: 170
+---
+
+# German analyzer
+
+The built-in `german` analyzer can be applied to a text field using the following command:
+
+```json
+PUT /german-index
+{
+  "mappings": {
+    "properties": {
+      "content": {
+        "type": "text",
+        "analyzer": "german"
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Stem exclusion
+
+You can use `stem_exclusion` with this language analyzer using the following command:
+
+```json
+PUT index_with_stem_exclusion_german_analyzer
+{
+  "settings": {
+    "analysis": {
+      "analyzer": {
+        "stem_exclusion_german_analyzer": {
+          "type": "german",
+          "stem_exclusion": ["Autorität", "Genehmigung"]
+        }
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## German analyzer internals
+
+The `german` analyzer is built using the following components:
+
+- Tokenizer: `standard`
+
+- Token filters:
+  - lowercase
+  - stop (German)
+  - keyword
+  - normalization (German)
+  - stemmer (German)
+
+## Custom German analyzer
+
+You can create a custom German analyzer using the following command:
+
+```json
+PUT /german-index
+{
+  "settings": {
+    "analysis": {
+      "filter": {
+        "german_stop": {
+          "type": "stop",
+          "stopwords": "_german_"
+        },
+        "german_stemmer": {
+          "type": "stemmer",
+          "language": "light_german"
+        },
+        "german_keywords": {
+          "type": "keyword_marker",
+          "keywords": []
+        }
+      },
+      "analyzer": {
+        "german_analyzer": {
+          "type": "custom",
+          "tokenizer": "standard",
+          "filter": [
+            "lowercase",
+            "german_stop",
+            "german_keywords",
+            "german_normalization",
+            "german_stemmer"
+          ]
+        }
+      }
+    }
+  },
+  "mappings": {
+    "properties": {
+      "content": {
+        "type": "text",
+        "analyzer": "german_analyzer"
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Generated tokens
+
+Use the following request to examine the tokens generated using the analyzer:
+
+```json
+POST /german-index/_analyze
+{
+  "field": "content",
+  "text": "Die Studenten studieren an den deutschen Universitäten. Ihre Nummern sind 123456."
+}
+```
+{% include copy-curl.html %}
+
+The response contains the generated tokens:
+
+```json
+{
+  "tokens": [
+    {
+      "token": "student",
+      "start_offset": 4,
+      "end_offset": 13,
+      "type": "<ALPHANUM>",
+      "position": 1
+    },
+    {
+      "token": "studi",
+      "start_offset": 14,
+      "end_offset": 23,
+      "type": "<ALPHANUM>",
+      "position": 2
+    },
+    {
+      "token": "deutsch",
+      "start_offset": 31,
+      "end_offset": 40,
+      "type": "<ALPHANUM>",
+      "position": 5
+    },
+    {
+      "token": "universitat",
+      "start_offset": 41,
+      "end_offset": 54,
+      "type": "<ALPHANUM>",
+      "position": 6
+    },
+    {
+      "token": "numm",
+      "start_offset": 61,
+      "end_offset": 68,
+      "type": "<ALPHANUM>",
+      "position": 8
+    },
+    {
+      "token": "123456",
+      "start_offset": 74,
+      "end_offset": 80,
+      "type": "<NUM>",
+      "position": 10
+    }
+  ]
+}
+```
\ No newline at end of file
diff --git a/_analyzers/language-analyzers/greek.md b/_analyzers/language-analyzers/greek.md
new file mode 100644
index 0000000000..2446b1e2d6
--- /dev/null
+++ b/_analyzers/language-analyzers/greek.md
@@ -0,0 +1,139 @@
+---
+layout: default
+title: Greek
+parent: Language analyzers
+grand_parent: Analyzers
+nav_order: 180
+---
+
+# Greek analyzer
+
+The built-in `greek` analyzer can be applied to a text field using the following command:
+
+```json
+PUT /greek-index
+{
+  "mappings": {
+    "properties": {
+      "content": {
+        "type": "text",
+        "analyzer": "greek"
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Stem exclusion
+
+You can use `stem_exclusion` with this language analyzer using the following command:
+
+```json
+PUT index_with_stem_exclusion_greek_analyzer
+{
+  "settings": {
+    "analysis": {
+      "analyzer": {
+        "stem_exclusion_greek_analyzer": {
+          "type": "greek",
+          "stem_exclusion": ["αρχή", "έγκριση"]
+        }
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Greek analyzer internals
+
+The `greek` analyzer is built using the following components:
+
+- Tokenizer: `standard`
+
+- Token filters:
+  - lowercase
+  - stop (Greek)
+  - keyword
+  - stemmer (Greek)
+
+## Custom Greek analyzer
+
+You can create a custom Greek analyzer using the following command:
+
+```json
+PUT /greek-index
+{
+  "settings": {
+    "analysis": {
+      "filter": {
+        "greek_stop": {
+          "type": "stop",
+          "stopwords": "_greek_"
+        },
+        "greek_stemmer": {
+          "type": "stemmer",
+          "language": "greek"
+        },
+        "greek_keywords": {
+          "type": "keyword_marker",
+          "keywords": []
+        }
+      },
+      "analyzer": {
+        "greek_analyzer": {
+          "type": "custom",
+          "tokenizer": "standard",
+          "filter": [
+            "lowercase",
+            "greek_stop",
+            "greek_keywords",
+            "greek_stemmer"
+          ]
+        }
+      }
+    }
+  },
+  "mappings": {
+    "properties": {
+      "content": {
+        "type": "text",
+        "analyzer": "greek_analyzer"
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Generated tokens
+
+Use the following request to examine the tokens generated using the analyzer:
+
+```json
+POST /greek-index/_analyze
+{
+  "field": "content",
+  "text": "Οι φοιτητές σπουδάζουν στα ελληνικά πανεπιστήμια. Οι αριθμοί τους είναι 123456."
+}
+```
+{% include copy-curl.html %}
+
+The response contains the generated tokens:
+
+```json
+{
+  "tokens": [
+    {"token": "φοιτητές","start_offset": 3,"end_offset": 11,"type": "<ALPHANUM>","position": 1},
+    {"token": "σπουδάζ","start_offset": 12,"end_offset": 22,"type": "<ALPHANUM>","position": 2},
+    {"token": "στα","start_offset": 23,"end_offset": 26,"type": "<ALPHANUM>","position": 3},
+    {"token": "ελληνικά","start_offset": 27,"end_offset": 35,"type": "<ALPHANUM>","position": 4},
+    {"token": "πανεπιστήμ","start_offset": 36,"end_offset": 48,"type": "<ALPHANUM>","position": 5},
+    {"token": "αριθμοί","start_offset": 53,"end_offset": 60,"type": "<ALPHANUM>","position": 7},
+    {"token": "τους","start_offset": 61,"end_offset": 65,"type": "<ALPHANUM>","position": 8},
+    {"token": "είνα","start_offset": 66,"end_offset": 71,"type": "<ALPHANUM>","position": 9},
+    {"token": "123456","start_offset": 72,"end_offset": 78,"type": "<NUM>","position": 10}
+  ]
+}
+```
\ No newline at end of file
diff --git a/_analyzers/language-analyzers/hindi.md b/_analyzers/language-analyzers/hindi.md
new file mode 100644
index 0000000000..93f2eea319
--- /dev/null
+++ b/_analyzers/language-analyzers/hindi.md
@@ -0,0 +1,178 @@
+---
+layout: default
+title: Hindi
+parent: Language analyzers
+grand_parent: Analyzers
+nav_order: 190
+---
+
+# Hindi analyzer
+
+The built-in `hindi` analyzer can be applied to a text field using the following command:
+
+```json
+PUT /hindi-index
+{
+  "mappings": {
+    "properties": {
+      "content": {
+        "type": "text",
+        "analyzer": "hindi"
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Stem exclusion
+
+You can use `stem_exclusion` with this language analyzer using the following command:
+
+```json
+PUT index_with_stem_exclusion_hindi_analyzer
+{
+  "settings": {
+    "analysis": {
+      "analyzer": {
+        "stem_exclusion_hindi_analyzer": {
+          "type": "hindi",
+          "stem_exclusion": ["अधिकार", "अनुमोदन"]
+        }
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Hindi analyzer internals
+
+The `hindi` analyzer is built using the following components:
+
+- Tokenizer: `standard`
+
+- Token filters:
+  - lowercase
+  - decimal_digit
+  - keyword
+  - normalization (indic)
+  - normalization (Hindi)
+  - stop (Hindi)
+  - stemmer (Hindi)
+
+## Custom Hindi analyzer
+
+You can create a custom Hindi analyzer using the following command:
+
+```json
+PUT /hindi-index
+{
+  "settings": {
+    "analysis": {
+      "filter": {
+        "hindi_stop": {
+          "type": "stop",
+          "stopwords": "_hindi_"
+        },
+        "hindi_stemmer": {
+          "type": "stemmer",
+          "language": "hindi"
+        },
+        "hindi_keywords": {
+          "type": "keyword_marker",
+          "keywords": []
+        }
+      },
+      "analyzer": {
+        "hindi_analyzer": {
+          "type": "custom",
+          "tokenizer": "standard",
+          "filter": [
+            "lowercase",
+            "decimal_digit",
+            "hindi_keywords",
+            "indic_normalization",
+            "hindi_normalization",
+            "hindi_stop",
+            "hindi_stemmer"
+          ]
+        }
+      }
+    }
+  },
+  "mappings": {
+    "properties": {
+      "content": {
+        "type": "text",
+        "analyzer": "hindi_analyzer"
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Generated tokens
+
+Use the following request to examine the tokens generated using the analyzer:
+
+```json
+POST /hindi-index/_analyze
+{
+  "field": "content",
+  "text": "छात्र भारतीय विश्वविद्यालयों में पढ़ते हैं। उनके नंबर १२३४५६ हैं।"
+}
+```
+{% include copy-curl.html %}
+
+The response contains the generated tokens:
+
+```json
+{
+  "tokens": [
+    {
+      "token": "छातर",
+      "start_offset": 0,
+      "end_offset": 5,
+      "type": "<ALPHANUM>",
+      "position": 0
+    },
+    {
+      "token": "भारतिय",
+      "start_offset": 6,
+      "end_offset": 12,
+      "type": "<ALPHANUM>",
+      "position": 1
+    },
+    {
+      "token": "विशवविदयालय",
+      "start_offset": 13,
+      "end_offset": 28,
+      "type": "<ALPHANUM>",
+      "position": 2
+    },
+    {
+      "token": "पढ",
+      "start_offset": 33,
+      "end_offset": 38,
+      "type": "<ALPHANUM>",
+      "position": 4
+    },
+    {
+      "token": "नंबर",
+      "start_offset": 49,
+      "end_offset": 53,
+      "type": "<ALPHANUM>",
+      "position": 7
+    },
+    {
+      "token": "123456",
+      "start_offset": 54,
+      "end_offset": 60,
+      "type": "<NUM>",
+      "position": 8
+    }
+  ]
+}
+```
\ No newline at end of file
diff --git a/_analyzers/language-analyzers/hungarian.md b/_analyzers/language-analyzers/hungarian.md
new file mode 100644
index 0000000000..d115c5d29c
--- /dev/null
+++ b/_analyzers/language-analyzers/hungarian.md
@@ -0,0 +1,172 @@
+---
+layout: default
+title: Hungarian
+parent: Language analyzers
+grand_parent: Analyzers
+nav_order: 200
+---
+
+# Hungarian analyzer
+
+The built-in `hungarian` analyzer can be applied to a text field using the following command:
+
+```json
+PUT /hungarian-index
+{
+  "mappings": {
+    "properties": {
+      "content": {
+        "type": "text",
+        "analyzer": "hungarian"
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Stem exclusion
+
+You can use `stem_exclusion` with this language analyzer using the following command:
+
+```json
+PUT index_with_stem_exclusion_hungarian_analyzer
+{
+  "settings": {
+    "analysis": {
+      "analyzer": {
+        "stem_exclusion_hungarian_analyzer": {
+          "type": "hungarian",
+          "stem_exclusion": ["hatalom", "jóváhagyás"]
+        }
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Hungarian analyzer internals
+
+The `hungarian` analyzer is built using the following components:
+
+- Tokenizer: `standard`
+
+- Token filters:
+  - lowercase
+  - stop (Hungarian)
+  - keyword
+  - stemmer (Hungarian)
+
+## Custom Hungarian analyzer
+
+You can create a custom Hungarian analyzer using the following command:
+
+```json
+PUT /hungarian-index
+{
+  "settings": {
+    "analysis": {
+      "filter": {
+        "hungarian_stop": {
+          "type": "stop",
+          "stopwords": "_hungarian_"
+        },
+        "hungarian_stemmer": {
+          "type": "stemmer",
+          "language": "hungarian"
+        },
+        "hungarian_keywords": {
+          "type": "keyword_marker",
+          "keywords": []
+        }
+      },
+      "analyzer": {
+        "hungarian_analyzer": {
+          "type": "custom",
+          "tokenizer": "standard",
+          "filter": [
+            "lowercase",
+            "hungarian_stop",
+            "hungarian_keywords",
+            "hungarian_stemmer"
+          ]
+        }
+      }
+    }
+  },
+  "mappings": {
+    "properties": {
+      "content": {
+        "type": "text",
+        "analyzer": "hungarian_analyzer"
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Generated tokens
+
+Use the following request to examine the tokens generated using the analyzer:
+
+```json
+POST /hungarian-index/_analyze
+{
+  "field": "content",
+  "text": "A diákok a magyar egyetemeken tanulnak. A számaik 123456."
+}
+```
+{% include copy-curl.html %}
+
+The response contains the generated tokens:
+
+```json
+{
+  "tokens": [
+    {
+      "token": "diák",
+      "start_offset": 2,
+      "end_offset": 8,
+      "type": "<ALPHANUM>",
+      "position": 1
+    },
+    {
+      "token": "magyar",
+      "start_offset": 11,
+      "end_offset": 17,
+      "type": "<ALPHANUM>",
+      "position": 3
+    },
+    {
+      "token": "egyetem",
+      "start_offset": 18,
+      "end_offset": 29,
+      "type": "<ALPHANUM>",
+      "position": 4
+    },
+    {
+      "token": "tanul",
+      "start_offset": 30,
+      "end_offset": 38,
+      "type": "<ALPHANUM>",
+      "position": 5
+    },
+    {
+      "token": "szám",
+      "start_offset": 42,
+      "end_offset": 49,
+      "type": "<ALPHANUM>",
+      "position": 7
+    },
+    {
+      "token": "123456",
+      "start_offset": 50,
+      "end_offset": 56,
+      "type": "<NUM>",
+      "position": 8
+    }
+  ]
+}
+```
\ No newline at end of file
diff --git a/_analyzers/language-analyzers/index.md b/_analyzers/language-analyzers/index.md
new file mode 100644
index 0000000000..89a4a42254
--- /dev/null
+++ b/_analyzers/language-analyzers/index.md
@@ -0,0 +1,135 @@
+---
+layout: default
+title: Language analyzers
+nav_order: 100
+parent: Analyzers
+has_children: true
+has_toc: true
+redirect_from:
+  - /query-dsl/analyzers/language-analyzers/
+  - /analyzers/language-analyzers/
+---
+
+# Language analyzers
+
+OpenSearch supports the following language analyzers:
+`arabic`, `armenian`, `basque`, `bengali`, `brazilian`, `bulgarian`, `catalan`, `czech`, `danish`, `dutch`, `english`, `estonian`, `finnish`, `french`, `galician`, `german`, `greek`, `hindi`, `hungarian`, `indonesian`, `irish`, `italian`, `latvian`, `lithuanian`, `norwegian`, `persian`, `portuguese`, `romanian`, `russian`, `sorani`, `spanish`, `swedish`, `thai`, and `turkish`.
+
+To use an analyzer when you map an index, specify the value in your query. For example, to map your index with the French language analyzer, specify the `french` value in the analyzer field:
+
+```json
+ "analyzer": "french"
+```
+
+#### Example request
+
+The following query specifies an index `my-index` with the `content` field configured as multi-field, and a sub-field named `french` is configured with the `french` language analyzer:
+
+```json
+PUT my-index
+{
+  "mappings": {
+    "properties": {
+      "content": { 
+        "type": "text",
+        "fields": {
+          "french": { 
+            "type": "text",
+            "analyzer": "french"
+          }
+        }
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+The default `french` analyzer can also be configured for the entire index using the following query:
+
+```json
+PUT my-index
+{
+  "settings": {
+    "analysis": {
+      "analyzer": {
+        "default": {
+          "type": "french"
+        }
+      }
+    }
+  },
+  "mappings": {
+    "properties": {
+      "content": {
+        "type": "text"
+      },
+      "title": {
+        "type": "text"
+      },
+      "description": {
+        "type": "text"
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Stem exclusion
+
+You can apply stem exclusion to any language analyzer by providing a list of lowercase words that should be excluded from stemming. Internally, OpenSearch uses the `keyword_marker` token filter to mark these words as keywords, ensuring that they are not stemmed.
+
+## Stem exclusion example
+
+Use the following request to configure `stem_exclusion`:
+
+```json
+PUT index_with_stem_exclusion_english_analyzer
+{
+  "settings": {
+    "analysis": {
+      "analyzer": {
+        "stem_exclusion_english_analyzer":{
+          "type":"english",
+          "stem_exclusion": ["manager", "management"]
+        }
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+
+## Stem exclusion with custom analyzers
+
+All language analyzers consist of tokenizers and token filters specific to a particular language. If you want to implement a custom version of the language analyzer with stem exclusion, you need to configure the `keyword_marker` token filter and list the words excluded from stemming in the `keywords` parameter:
+
+```json
+PUT index_with_keyword_marker_analyzer
+{
+  "settings": {
+    "analysis": {
+      "filter": {
+        "protected_keywords_filter": {
+          "type": "keyword_marker",
+          "keywords": ["Apple", "OpenSearch"]
+        }
+      },
+      "analyzer": {
+        "custom_english_analyzer": {
+          "type": "custom",
+          "tokenizer": "standard",
+          "filter": [
+            "lowercase",
+            "protected_keywords_filter",
+            "english_stemmer"
+          ]
+        }
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
diff --git a/_analyzers/language-analyzers/indonesian.md b/_analyzers/language-analyzers/indonesian.md
new file mode 100644
index 0000000000..5c3d430b3a
--- /dev/null
+++ b/_analyzers/language-analyzers/indonesian.md
@@ -0,0 +1,172 @@
+---
+layout: default
+title: Indonesian
+parent: Language analyzers
+grand_parent: Analyzers
+nav_order: 210
+---
+
+# Indonesian analyzer
+
+The built-in `indonesian` analyzer can be applied to a text field using the following command:
+
+```json
+PUT /indonesian-index
+{
+  "mappings": {
+    "properties": {
+      "content": {
+        "type": "text",
+        "analyzer": "indonesian"
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Stem exclusion
+
+You can use `stem_exclusion` with this language analyzer using the following command:
+
+```json
+PUT index_with_stem_exclusion_indonesian_analyzer
+{
+  "settings": {
+    "analysis": {
+      "analyzer": {
+        "stem_exclusion_indonesian_analyzer": {
+          "type": "indonesian",
+          "stem_exclusion": ["otoritas", "persetujuan"]
+        }
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Indonesian analyzer internals
+
+The `indonesian` analyzer is built using the following components:
+
+- Tokenizer: `standard`
+
+- Token filters:
+  - lowercase
+  - stop (Indonesian)
+  - keyword
+  - stemmer (Indonesian)
+
+## Custom Indonesian analyzer
+
+You can create a custom Indonesian analyzer using the following command:
+
+```json
+PUT /hungarian-index
+{
+  "settings": {
+    "analysis": {
+      "filter": {
+        "hungarian_stop": {
+          "type": "stop",
+          "stopwords": "_hungarian_"
+        },
+        "hungarian_stemmer": {
+          "type": "stemmer",
+          "language": "hungarian"
+        },
+        "hungarian_keywords": {
+          "type": "keyword_marker",
+          "keywords": []
+        }
+      },
+      "analyzer": {
+        "hungarian_analyzer": {
+          "type": "custom",
+          "tokenizer": "standard",
+          "filter": [
+            "lowercase",
+            "hungarian_stop",
+            "hungarian_keywords",
+            "hungarian_stemmer"
+          ]
+        }
+      }
+    }
+  },
+  "mappings": {
+    "properties": {
+      "content": {
+        "type": "text",
+        "analyzer": "hungarian_analyzer"
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Generated tokens
+
+Use the following request to examine the tokens generated using the analyzer:
+
+```json
+POST /indonesian-index/_analyze
+{
+  "field": "content",
+  "text": "Mahasiswa belajar di universitas Indonesia. Nomor mereka adalah 123456."
+}
+```
+{% include copy-curl.html %}
+
+The response contains the generated tokens:
+
+```json
+{
+  "tokens": [
+    {
+      "token": "mahasiswa",
+      "start_offset": 0,
+      "end_offset": 9,
+      "type": "<ALPHANUM>",
+      "position": 0
+    },
+    {
+      "token": "ajar",
+      "start_offset": 10,
+      "end_offset": 17,
+      "type": "<ALPHANUM>",
+      "position": 1
+    },
+    {
+      "token": "universitas",
+      "start_offset": 21,
+      "end_offset": 32,
+      "type": "<ALPHANUM>",
+      "position": 3
+    },
+    {
+      "token": "indonesia",
+      "start_offset": 33,
+      "end_offset": 42,
+      "type": "<ALPHANUM>",
+      "position": 4
+    },
+    {
+      "token": "nomor",
+      "start_offset": 44,
+      "end_offset": 49,
+      "type": "<ALPHANUM>",
+      "position": 5
+    },
+    {
+      "token": "123456",
+      "start_offset": 64,
+      "end_offset": 70,
+      "type": "<NUM>",
+      "position": 8
+    }
+  ]
+}
+```
\ No newline at end of file
diff --git a/_analyzers/language-analyzers/irish.md b/_analyzers/language-analyzers/irish.md
new file mode 100644
index 0000000000..3e1535d134
--- /dev/null
+++ b/_analyzers/language-analyzers/irish.md
@@ -0,0 +1,157 @@
+---
+layout: default
+title: Irish
+parent: Language analyzers
+grand_parent: Analyzers
+nav_order: 210
+---
+
+# Irish analyzer
+
+The built-in `irish` analyzer can be applied to a text field using the following command:
+
+```json
+PUT /irish-index
+{
+  "mappings": {
+    "properties": {
+      "content": {
+        "type": "text",
+        "analyzer": "irish"
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Stem exclusion
+
+You can use `stem_exclusion` with this language analyzer using the following command:
+
+```json
+PUT index_with_stem_exclusion_irish_analyzer
+{
+  "settings": {
+    "analysis": {
+      "analyzer": {
+        "stem_exclusion_irish_analyzer": {
+          "type": "irish",
+          "stem_exclusion": ["údarás", "faomhadh"]
+        }
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Irish analyzer internals
+
+The `irish` analyzer is built using the following components:
+
+- Tokenizer: `standard`
+
+- Token filters:
+  - hyphenation (Irish)
+  - elision (Irish)
+  - lowercase (Irish)
+  - stop (Irish)
+  - keyword
+  - stemmer (Irish)
+
+## Custom Irish analyzer
+
+You can create a custom Irish analyzer using the following command:
+
+```json
+PUT /irish-index
+{
+  "settings": {
+    "analysis": {
+      "filter": {
+        "irish_stop": {
+          "type": "stop",
+          "stopwords": "_irish_"
+        },
+        "irish_elision": {
+          "type":       "elision",
+          "articles":   [ "d", "m", "b" ],
+          "articles_case": true
+        },
+        "irish_hyphenation": {
+          "type":       "stop",
+          "stopwords":  [ "h", "n", "t" ],
+          "ignore_case": true
+        },
+        "irish_lowercase": {
+          "type":       "lowercase",
+          "language":   "irish"
+        },
+        "irish_stemmer": {
+          "type": "stemmer",
+          "language": "irish"
+        },
+        "irish_keywords": {
+          "type": "keyword_marker",
+          "keywords": []
+        }
+      },
+      "analyzer": {
+        "irish_analyzer": {
+          "type": "custom",
+          "tokenizer": "standard",
+          "filter": [
+            "irish_hyphenation",
+            "irish_elision",
+            "irish_lowercase",
+            "irish_stop",
+            "irish_keywords",
+            "irish_stemmer"
+          ]
+        }
+      }
+    }
+  },
+  "mappings": {
+    "properties": {
+      "content": {
+        "type": "text",
+        "analyzer": "irish_analyzer"
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Generated tokens
+
+Use the following request to examine the tokens generated using the analyzer:
+
+```json
+POST /irish-index/_analyze
+{
+  "field": "content",
+  "text": "Tá mic léinn ag staidéar in ollscoileanna na hÉireann. Is iad a gcuid uimhreacha ná 123456."
+}
+```
+{% include copy-curl.html %}
+
+The response contains the generated tokens:
+
+```json
+{
+  "tokens": [
+    {"token": "tá","start_offset": 0,"end_offset": 2,"type": "<ALPHANUM>","position": 0},
+    {"token": "mic","start_offset": 3,"end_offset": 6,"type": "<ALPHANUM>","position": 1},
+    {"token": "léinn","start_offset": 7,"end_offset": 12,"type": "<ALPHANUM>","position": 2},
+    {"token": "staidéar","start_offset": 16,"end_offset": 24,"type": "<ALPHANUM>","position": 4},
+    {"token": "ollscoileanna","start_offset": 28,"end_offset": 41,"type": "<ALPHANUM>","position": 6},
+    {"token": "héireann","start_offset": 45,"end_offset": 53,"type": "<ALPHANUM>","position": 8},
+    {"token": "cuid","start_offset": 64,"end_offset": 69,"type": "<ALPHANUM>","position": 12},
+    {"token": "uimhreacha","start_offset": 70,"end_offset": 80,"type": "<ALPHANUM>","position": 13},
+    {"token": "123456","start_offset": 84,"end_offset": 90,"type": "<NUM>","position": 15}
+  ]
+}
+```
\ No newline at end of file
diff --git a/_analyzers/language-analyzers/italian.md b/_analyzers/language-analyzers/italian.md
new file mode 100644
index 0000000000..190056d63c
--- /dev/null
+++ b/_analyzers/language-analyzers/italian.md
@@ -0,0 +1,148 @@
+---
+layout: default
+title: Italian
+parent: Language analyzers
+grand_parent: Analyzers
+nav_order: 220
+---
+
+# Italian analyzer
+
+The built-in `italian` analyzer can be applied to a text field using the following command:
+
+```json
+PUT /italian-index
+{
+  "mappings": {
+    "properties": {
+      "content": {
+        "type": "text",
+        "analyzer": "italian"
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Stem exclusion
+
+You can use `stem_exclusion` with this language analyzer using the following command:
+
+```json
+PUT index_with_stem_exclusion_italian_analyzer
+{
+  "settings": {
+    "analysis": {
+      "analyzer": {
+        "stem_exclusion_italian_analyzer": {
+          "type": "italian",
+          "stem_exclusion": ["autorità", "approvazione"]
+        }
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Italian analyzer internals
+
+The `italian` analyzer is built using the following components:
+
+- Tokenizer: `standard`
+
+- Token filters:
+  - elision (Italian)
+  - lowercase
+  - stop (Italian)
+  - keyword
+  - stemmer (Italian)
+
+## Custom Italian analyzer
+
+You can create a custom Italian analyzer using the following command:
+
+```json
+PUT /italian-index
+{
+  "settings": {
+    "analysis": {
+      "filter": {
+        "italian_stop": {
+          "type": "stop",
+          "stopwords": "_italian_"
+        },
+        "italian_elision": {
+          "type": "elision",
+          "articles": [
+                "c", "l", "all", "dall", "dell",
+                "nell", "sull", "coll", "pell",
+                "gl", "agl", "dagl", "degl", "negl",
+                "sugl", "un", "m", "t", "s", "v", "d"
+          ],
+          "articles_case": true
+        },
+        "italian_stemmer": {
+          "type": "stemmer",
+          "language": "light_italian"
+        },
+        "italian_keywords": {
+          "type": "keyword_marker",
+          "keywords": []
+        }
+      },
+      "analyzer": {
+        "italian_analyzer": {
+          "type": "custom",
+          "tokenizer": "standard",
+          "filter": [
+            "italian_elision",
+            "lowercase",
+            "italian_stop",
+            "italian_keywords",
+            "italian_stemmer"
+          ]
+        }
+      }
+    }
+  },
+  "mappings": {
+    "properties": {
+      "content": {
+        "type": "text",
+        "analyzer": "italian_analyzer"
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Generated tokens
+
+Use the following request to examine the tokens generated using the analyzer:
+
+```json
+POST /italian-index/_analyze
+{
+  "field": "content",
+  "text": "Gli studenti studiano nelle università italiane. I loro numeri sono 123456."
+}
+```
+{% include copy-curl.html %}
+
+The response contains the generated tokens:
+
+```json
+{
+  "tokens": [
+    {"token": "student","start_offset": 4,"end_offset": 12,"type": "<ALPHANUM>","position": 1},
+    {"token": "studian","start_offset": 13,"end_offset": 21,"type": "<ALPHANUM>","position": 2},
+    {"token": "universit","start_offset": 28,"end_offset": 38,"type": "<ALPHANUM>","position": 4},
+    {"token": "italian","start_offset": 39,"end_offset": 47,"type": "<ALPHANUM>","position": 5},
+    {"token": "numer","start_offset": 56,"end_offset": 62,"type": "<ALPHANUM>","position": 8},
+    {"token": "123456","start_offset": 68,"end_offset": 74,"type": "<NUM>","position": 10}
+  ]
+}
+```
\ No newline at end of file
diff --git a/_analyzers/language-analyzers/latvian.md b/_analyzers/language-analyzers/latvian.md
new file mode 100644
index 0000000000..2301759763
--- /dev/null
+++ b/_analyzers/language-analyzers/latvian.md
@@ -0,0 +1,148 @@
+---
+layout: default
+title: Latvian
+parent: Language analyzers
+grand_parent: Analyzers
+nav_order: 230
+---
+
+# Latvian analyzer
+
+The built-in `latvian` analyzer can be applied to a text field using the following command:
+
+```json
+PUT /latvian-index
+{
+  "mappings": {
+    "properties": {
+      "content": {
+        "type": "text",
+        "analyzer": "latvian"
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Stem exclusion
+
+You can use `stem_exclusion` with this language analyzer using the following command:
+
+```json
+PUT index_with_stem_exclusion_latvian_analyzer
+{
+  "settings": {
+    "analysis": {
+      "analyzer": {
+        "stem_exclusion_latvian_analyzer": {
+          "type": "latvian",
+          "stem_exclusion": ["autoritāte", "apstiprinājums"]
+        }
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Latvian analyzer internals
+
+The `latvian` analyzer is built using the following components:
+
+- Tokenizer: `standard`
+
+- Token filters:
+  - lowercase
+  - stop (Latvian)
+  - keyword
+  - stemmer (Latvian)
+
+## Custom Latvian analyzer
+
+You can create a custom Latvian analyzer using the following command:
+
+```json
+PUT /italian-index
+{
+  "settings": {
+    "analysis": {
+      "filter": {
+        "italian_stop": {
+          "type": "stop",
+          "stopwords": "_italian_"
+        },
+        "italian_elision": {
+          "type": "elision",
+          "articles": [
+                "c", "l", "all", "dall", "dell",
+                "nell", "sull", "coll", "pell",
+                "gl", "agl", "dagl", "degl", "negl",
+                "sugl", "un", "m", "t", "s", "v", "d"
+          ],
+          "articles_case": true
+        },
+        "italian_stemmer": {
+          "type": "stemmer",
+          "language": "light_italian"
+        },
+        "italian_keywords": {
+          "type": "keyword_marker",
+          "keywords": []
+        }
+      },
+      "analyzer": {
+        "italian_analyzer": {
+          "type": "custom",
+          "tokenizer": "standard",
+          "filter": [
+            "italian_elision",
+            "lowercase",
+            "italian_stop",
+            "italian_keywords",
+            "italian_stemmer"
+          ]
+        }
+      }
+    }
+  },
+  "mappings": {
+    "properties": {
+      "content": {
+        "type": "text",
+        "analyzer": "italian_analyzer"
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Generated tokens
+
+Use the following request to examine the tokens generated using the analyzer:
+
+```json
+POST /latvian-index/_analyze
+{
+  "field": "content",
+  "text": "Studenti mācās Latvijas universitātēs. Viņu numuri ir 123456."
+}
+```
+{% include copy-curl.html %}
+
+The response contains the generated tokens:
+
+```json
+{
+  "tokens": [
+    {"token": "student","start_offset": 0,"end_offset": 8,"type": "<ALPHANUM>","position": 0},
+    {"token": "māc","start_offset": 9,"end_offset": 14,"type": "<ALPHANUM>","position": 1},
+    {"token": "latvij","start_offset": 15,"end_offset": 23,"type": "<ALPHANUM>","position": 2},
+    {"token": "universitāt","start_offset": 24,"end_offset": 37,"type": "<ALPHANUM>","position": 3},
+    {"token": "vin","start_offset": 39,"end_offset": 43,"type": "<ALPHANUM>","position": 4},
+    {"token": "numur","start_offset": 44,"end_offset": 50,"type": "<ALPHANUM>","position": 5},
+    {"token": "123456","start_offset": 54,"end_offset": 60,"type": "<NUM>","position": 7}
+  ]
+}
+```
\ No newline at end of file
diff --git a/_analyzers/language-analyzers/lithuanian.md b/_analyzers/language-analyzers/lithuanian.md
new file mode 100644
index 0000000000..ca5966c54e
--- /dev/null
+++ b/_analyzers/language-analyzers/lithuanian.md
@@ -0,0 +1,136 @@
+---
+layout: default
+title: Lithuanian
+parent: Language analyzers
+grand_parent: Analyzers
+nav_order: 230
+---
+
+# Lithuanian analyzer
+
+The built-in `lithuanian` analyzer can be applied to a text field using the following command:
+
+```json
+PUT /lithuanian-index
+{
+  "mappings": {
+    "properties": {
+      "content": {
+        "type": "text",
+        "analyzer": "lithuanian"
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Stem exclusion
+
+You can use `stem_exclusion` with this language analyzer using the following command:
+
+```json
+PUT index_with_stem_exclusion_lithuanian_analyzer
+{
+  "settings": {
+    "analysis": {
+      "analyzer": {
+        "stem_exclusion_lithuanian_analyzer": {
+          "type": "lithuanian",
+          "stem_exclusion": ["autoritetas", "patvirtinimas"]
+        }
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Lithuanian analyzer internals
+
+The `lithuanian` analyzer is built using the following components:
+
+- Tokenizer: `standard`
+
+- Token filters:
+  - lowercase
+  - stop (Lithuanian)
+  - keyword
+  - stemmer (Lithuanian)
+
+## Custom Lithuanian analyzer
+
+You can create a custom Lithuanian analyzer using the following command:
+
+```json
+PUT /lithuanian-index
+{
+  "settings": {
+    "analysis": {
+      "filter": {
+        "lithuanian_stop": {
+          "type": "stop",
+          "stopwords": "_lithuanian_"
+        },
+        "lithuanian_stemmer": {
+          "type": "stemmer",
+          "language": "lithuanian"
+        },
+        "lithuanian_keywords": {
+          "type": "keyword_marker",
+          "keywords": []
+        }
+      },
+      "analyzer": {
+        "lithuanian_analyzer": {
+          "type": "custom",
+          "tokenizer": "standard",
+          "filter": [
+            "lowercase",
+            "lithuanian_stop",
+            "lithuanian_keywords",
+            "lithuanian_stemmer"
+          ]
+        }
+      }
+    }
+  },
+  "mappings": {
+    "properties": {
+      "content": {
+        "type": "text",
+        "analyzer": "lithuanian_analyzer"
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Generated tokens
+
+Use the following request to examine the tokens generated using the analyzer:
+
+```json
+POST /lithuanian-index/_analyze
+{
+  "field": "content",
+  "text": "Studentai mokosi Lietuvos universitetuose. Jų numeriai yra 123456."
+}
+```
+{% include copy-curl.html %}
+
+The response contains the generated tokens:
+
+```json
+{
+  "tokens": [
+    {"token": "student","start_offset": 0,"end_offset": 9,"type": "<ALPHANUM>","position": 0},
+    {"token": "mok","start_offset": 10,"end_offset": 16,"type": "<ALPHANUM>","position": 1},
+    {"token": "lietuv","start_offset": 17,"end_offset": 25,"type": "<ALPHANUM>","position": 2},
+    {"token": "universitet","start_offset": 26,"end_offset": 41,"type": "<ALPHANUM>","position": 3},
+    {"token": "num","start_offset": 46,"end_offset": 54,"type": "<ALPHANUM>","position": 5},
+    {"token": "123456","start_offset": 59,"end_offset": 65,"type": "<NUM>","position": 7}
+  ]
+}
+```
\ No newline at end of file
diff --git a/_analyzers/language-analyzers/norwegian.md b/_analyzers/language-analyzers/norwegian.md
new file mode 100644
index 0000000000..cfb04eebf3
--- /dev/null
+++ b/_analyzers/language-analyzers/norwegian.md
@@ -0,0 +1,137 @@
+---
+layout: default
+title: Norwegian
+parent: Language analyzers
+grand_parent: Analyzers
+nav_order: 240
+---
+
+# Norwegian analyzer
+
+The built-in `norwegian` analyzer can be applied to a text field using the following command:
+
+```json
+PUT /norwegian-index
+{
+  "mappings": {
+    "properties": {
+      "content": {
+        "type": "text",
+        "analyzer": "norwegian"
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Stem exclusion
+
+You can use `stem_exclusion` with this language analyzer using the following command:
+
+```json
+PUT index_with_stem_exclusion_norwegian_analyzer
+{
+  "settings": {
+    "analysis": {
+      "analyzer": {
+        "stem_exclusion_norwegian_analyzer": {
+          "type": "norwegian",
+          "stem_exclusion": ["autoritet", "godkjenning"]
+        }
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Norwegian analyzer internals
+
+The `norwegian` analyzer is built using the following components:
+
+- Tokenizer: `standard`
+
+- Token filters:
+  - lowercase
+  - stop (Norwegian)
+  - keyword
+  - stemmer (Norwegian)
+
+## Custom Norwegian analyzer
+
+You can create a custom Norwegian analyzer using the following command:
+
+```json
+PUT /norwegian-index
+{
+  "settings": {
+    "analysis": {
+      "filter": {
+        "norwegian_stop": {
+          "type": "stop",
+          "stopwords": "_norwegian_"
+        },
+        "norwegian_stemmer": {
+          "type": "stemmer",
+          "language": "norwegian"
+        },
+        "norwegian_keywords": {
+          "type": "keyword_marker",
+          "keywords": []
+        }
+      },
+      "analyzer": {
+        "norwegian_analyzer": {
+          "type": "custom",
+          "tokenizer": "standard",
+          "filter": [
+            "lowercase",
+            "norwegian_stop",
+            "norwegian_keywords",
+            "norwegian_stemmer"
+          ]
+        }
+      }
+    }
+  },
+  "mappings": {
+    "properties": {
+      "content": {
+        "type": "text",
+        "analyzer": "norwegian_analyzer"
+      }
+    }
+  }
+}
+
+```
+{% include copy-curl.html %}
+
+## Generated tokens
+
+Use the following request to examine the tokens generated using the analyzer:
+
+```json
+POST /norwegian-index/_analyze
+{
+  "field": "content",
+  "text": "Studentene studerer ved norske universiteter. Deres nummer er 123456."
+}
+```
+{% include copy-curl.html %}
+
+The response contains the generated tokens:
+
+```json
+{
+  "tokens": [
+    {"token": "student","start_offset": 0,"end_offset": 10,"type": "<ALPHANUM>","position": 0},
+    {"token": "studer","start_offset": 11,"end_offset": 19,"type": "<ALPHANUM>","position": 1},
+    {"token": "norsk","start_offset": 24,"end_offset": 30,"type": "<ALPHANUM>","position": 3},
+    {"token": "universitet","start_offset": 31,"end_offset": 44,"type": "<ALPHANUM>","position": 4},
+    {"token": "numm","start_offset": 52,"end_offset": 58,"type": "<ALPHANUM>","position": 6},
+    {"token": "123456","start_offset": 62,"end_offset": 68,"type": "<NUM>","position": 8}
+  ]
+}
+```
\ No newline at end of file
diff --git a/_analyzers/language-analyzers/persian.md b/_analyzers/language-analyzers/persian.md
new file mode 100644
index 0000000000..40b38656fd
--- /dev/null
+++ b/_analyzers/language-analyzers/persian.md
@@ -0,0 +1,144 @@
+---
+layout: default
+title: Persian
+parent: Language analyzers
+grand_parent: Analyzers
+nav_order: 250
+---
+
+# Persian analyzer
+
+The built-in `persian` analyzer can be applied to a text field using the following command:
+
+```json
+PUT /persian-index
+{
+  "mappings": {
+    "properties": {
+      "content": {
+        "type": "text",
+        "analyzer": "persian"
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Stem exclusion
+
+You can use `stem_exclusion` with this language analyzer using the following command:
+
+```json
+PUT index_with_stem_exclusion_persian_analyzer
+{
+  "settings": {
+    "analysis": {
+      "analyzer": {
+        "stem_exclusion_persian_analyzer": {
+          "type": "persian",
+          "stem_exclusion": ["حکومت", "تأیید"]
+        }
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Persian analyzer internals
+
+The `persian` analyzer is built using the following components:
+
+- Tokenizer: `standard`
+
+- Char filter: `mapping`
+
+- Token filters:
+  - lowercase
+  - decimal_digit
+  - normalization (Arabic)
+  - normalization (Persian)
+  - keyword
+  - stemmer (Norwegian)
+
+## Custom Persian analyzer
+
+You can create a custom Persian analyzer using the following command:
+
+```json
+PUT /persian-index
+{
+  "settings": {
+    "analysis": {
+      "filter": {
+        "persian_stop": {
+          "type": "stop",
+          "stopwords": "_persian_"
+        },
+        "persian_keywords": {
+          "type": "keyword_marker",
+          "keywords": []
+        }
+      },
+      "char_filter": {
+        "null_width_replace_with_space": {
+            "type":       "mapping",
+            "mappings": [ "\\u200C=>\\u0020"] 
+        }
+      },
+      "analyzer": {
+        "persian_analyzer": {
+          "type": "custom",
+          "tokenizer": "standard",
+          "char_filter": [ "null_width_replace_with_space" ],
+          "filter": [
+            "lowercase",
+            "decimal_digit",
+            "arabic_normalization",
+            "persian_normalization",
+            "persian_stop"
+          ]
+        }
+      }
+    }
+  },
+  "mappings": {
+    "properties": {
+      "content": {
+        "type": "text",
+        "analyzer": "persian_analyzer"
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Generated tokens
+
+Use the following request to examine the tokens generated using the analyzer:
+
+```json
+POST /persian-index/_analyze
+{
+  "field": "content",
+  "text": "دانشجویان در دانشگاه‌های ایرانی تحصیل می‌کنند. شماره‌های آن‌ها ۱۲۳۴۵۶ است."
+}
+```
+{% include copy-curl.html %}
+
+The response contains the generated tokens:
+
+```json
+{
+  "tokens": [
+    {"token": "دانشجويان","start_offset": 0,"end_offset": 9,"type": "<ALPHANUM>","position": 0},
+    {"token": "دانشگاه","start_offset": 13,"end_offset": 20,"type": "<ALPHANUM>","position": 2},
+    {"token": "ايراني","start_offset": 25,"end_offset": 31,"type": "<ALPHANUM>","position": 4},
+    {"token": "تحصيل","start_offset": 32,"end_offset": 37,"type": "<ALPHANUM>","position": 5},
+    {"token": "شماره","start_offset": 47,"end_offset": 52,"type": "<ALPHANUM>","position": 8},
+    {"token": "123456","start_offset": 63,"end_offset": 69,"type": "<NUM>","position": 12}
+  ]
+}
+```
\ No newline at end of file
diff --git a/_analyzers/language-analyzers/portuguese.md b/_analyzers/language-analyzers/portuguese.md
new file mode 100644
index 0000000000..166ffa0010
--- /dev/null
+++ b/_analyzers/language-analyzers/portuguese.md
@@ -0,0 +1,172 @@
+---
+layout: default
+title: Portuguese
+parent: Language analyzers
+grand_parent: Analyzers
+nav_order: 260
+---
+
+# Portuguese analyzer
+
+The built-in `portuguese` analyzer can be applied to a text field using the following command:
+
+```json
+PUT /portuguese-index
+{
+  "mappings": {
+    "properties": {
+      "content": {
+        "type": "text",
+        "analyzer": "portuguese"
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Stem exclusion
+
+You can use `stem_exclusion` with this language analyzer using the following command:
+
+```json
+PUT index_with_stem_exclusion_portuguese_analyzer
+{
+  "settings": {
+    "analysis": {
+      "analyzer": {
+        "stem_exclusion_portuguese_analyzer": {
+          "type": "portuguese",
+          "stem_exclusion": ["autoridade", "aprovação"]
+        }
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Portuguese analyzer internals
+
+The `portuguese` analyzer is built using the following components:
+
+- Tokenizer: `standard`
+
+- Token filters:
+  - lowercase
+  - stop (Portuguese)
+  - keyword
+  - stemmer (Portuguese)
+
+## Custom Portuguese analyzer
+
+You can create a custom Portuguese analyzer using the following command:
+
+```json
+PUT /portuguese-index
+{
+  "settings": {
+    "analysis": {
+      "filter": {
+        "portuguese_stop": {
+          "type": "stop",
+          "stopwords": "_portuguese_"
+        },
+        "portuguese_stemmer": {
+          "type": "stemmer",
+          "language": "light_portuguese"
+        },
+        "portuguese_keywords": {
+          "type": "keyword_marker",
+          "keywords": []
+        }
+      },
+      "analyzer": {
+        "portuguese_analyzer": {
+          "type": "custom",
+          "tokenizer": "standard",
+          "filter": [
+            "lowercase",
+            "portuguese_stop",
+            "portuguese_keywords",
+            "portuguese_stemmer"
+          ]
+        }
+      }
+    }
+  },
+  "mappings": {
+    "properties": {
+      "content": {
+        "type": "text",
+        "analyzer": "portuguese_analyzer"
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Generated tokens
+
+Use the following request to examine the tokens generated using the analyzer:
+
+```json
+POST /portuguese-index/_analyze
+{
+  "field": "content",
+  "text": "Os estudantes estudam nas universidades brasileiras. Seus números são 123456."
+}
+```
+{% include copy-curl.html %}
+
+The response contains the generated tokens:
+
+```json
+{
+  "tokens": [
+    {
+      "token": "estudant",
+      "start_offset": 3,
+      "end_offset": 13,
+      "type": "<ALPHANUM>",
+      "position": 1
+    },
+    {
+      "token": "estudam",
+      "start_offset": 14,
+      "end_offset": 21,
+      "type": "<ALPHANUM>",
+      "position": 2
+    },
+    {
+      "token": "universidad",
+      "start_offset": 26,
+      "end_offset": 39,
+      "type": "<ALPHANUM>",
+      "position": 4
+    },
+    {
+      "token": "brasileir",
+      "start_offset": 40,
+      "end_offset": 51,
+      "type": "<ALPHANUM>",
+      "position": 5
+    },
+    {
+      "token": "numer",
+      "start_offset": 58,
+      "end_offset": 65,
+      "type": "<ALPHANUM>",
+      "position": 7
+    },
+    {
+      "token": "123456",
+      "start_offset": 70,
+      "end_offset": 76,
+      "type": "<NUM>",
+      "position": 9
+    }
+  ]
+}
+```
\ No newline at end of file
diff --git a/_analyzers/language-analyzers/romanian.md b/_analyzers/language-analyzers/romanian.md
new file mode 100644
index 0000000000..cad0953385
--- /dev/null
+++ b/_analyzers/language-analyzers/romanian.md
@@ -0,0 +1,172 @@
+---
+layout: default
+title: Romanian
+parent: Language analyzers
+grand_parent: Analyzers
+nav_order: 270
+---
+
+# Romanian analyzer
+
+The built-in `romanian` analyzer can be applied to a text field using the following command:
+
+```json
+PUT /romanian-index
+{
+  "mappings": {
+    "properties": {
+      "content": {
+        "type": "text",
+        "analyzer": "romanian"
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Stem exclusion
+
+You can use `stem_exclusion` with this language analyzer using the following command:
+
+```json
+PUT index_with_stem_exclusion_romanian_analyzer
+{
+  "settings": {
+    "analysis": {
+      "analyzer": {
+        "stem_exclusion_romanian_analyzer": {
+          "type": "romanian",
+          "stem_exclusion": ["autoritate", "aprobat"]
+        }
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Romanian analyzer internals
+
+The `romanian` analyzer is built using the following components:
+
+- Tokenizer: `standard`
+
+- Token filters:
+  - lowercase
+  - stop (Romanian)
+  - keyword
+  - stemmer (Romanian)
+
+## Custom Romanian analyzer
+
+You can create a custom Romanian analyzer using the following command:
+
+```json
+PUT /romanian-index
+{
+  "settings": {
+    "analysis": {
+      "filter": {
+        "romanian_stop": {
+          "type": "stop",
+          "stopwords": "_romanian_"
+        },
+        "romanian_stemmer": {
+          "type": "stemmer",
+          "language": "romanian"
+        },
+        "romanian_keywords": {
+          "type": "keyword_marker",
+          "keywords": []
+        }
+      },
+      "analyzer": {
+        "romanian_analyzer": {
+          "type": "custom",
+          "tokenizer": "standard",
+          "filter": [
+            "lowercase",
+            "romanian_stop",
+            "romanian_keywords",
+            "romanian_stemmer"
+          ]
+        }
+      }
+    }
+  },
+  "mappings": {
+    "properties": {
+      "content": {
+        "type": "text",
+        "analyzer": "romanian_analyzer"
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Generated tokens
+
+Use the following request to examine the tokens generated using the analyzer:
+
+```json
+POST /romanian-index/_analyze
+{
+  "field": "content",
+  "text": "Studenții învață la universitățile din România. Numerele lor sunt 123456."
+}
+```
+{% include copy-curl.html %}
+
+The response contains the generated tokens:
+
+```json
+{
+  "tokens": [
+    {
+      "token": "studenț",
+      "start_offset": 0,
+      "end_offset": 9,
+      "type": "<ALPHANUM>",
+      "position": 0
+    },
+    {
+      "token": "învaț",
+      "start_offset": 10,
+      "end_offset": 16,
+      "type": "<ALPHANUM>",
+      "position": 1
+    },
+    {
+      "token": "universităț",
+      "start_offset": 20,
+      "end_offset": 34,
+      "type": "<ALPHANUM>",
+      "position": 3
+    },
+    {
+      "token": "român",
+      "start_offset": 39,
+      "end_offset": 46,
+      "type": "<ALPHANUM>",
+      "position": 5
+    },
+    {
+      "token": "numer",
+      "start_offset": 48,
+      "end_offset": 56,
+      "type": "<ALPHANUM>",
+      "position": 6
+    },
+    {
+      "token": "123456",
+      "start_offset": 66,
+      "end_offset": 72,
+      "type": "<NUM>",
+      "position": 9
+    }
+  ]
+}
+```
\ No newline at end of file
diff --git a/_analyzers/language-analyzers/russian.md b/_analyzers/language-analyzers/russian.md
new file mode 100644
index 0000000000..bd57ba0b27
--- /dev/null
+++ b/_analyzers/language-analyzers/russian.md
@@ -0,0 +1,172 @@
+---
+layout: default
+title: Russian
+parent: Language analyzers
+grand_parent: Analyzers
+nav_order: 280
+---
+
+# Russian analyzer
+
+The built-in `russian` analyzer can be applied to a text field using the following command:
+
+```json
+PUT /russian-index
+{
+  "mappings": {
+    "properties": {
+      "content": {
+        "type": "text",
+        "analyzer": "russian"
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Stem exclusion
+
+You can use `stem_exclusion` with this language analyzer using the following command:
+
+```json
+PUT index_with_stem_exclusion_russian_analyzer
+{
+  "settings": {
+    "analysis": {
+      "analyzer": {
+        "stem_exclusion_russian_analyzer": {
+          "type": "russian",
+          "stem_exclusion": ["авторитет", "одобрение"]
+        }
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Russian analyzer internals
+
+The `russian` analyzer is built using the following components:
+
+- Tokenizer: `standard`
+
+- Token filters:
+  - lowercase
+  - stop (Russian)
+  - keyword
+  - stemmer (Russian)
+
+## Custom Russian analyzer
+
+You can create a custom Russian analyzer using the following command:
+
+```json
+PUT /russian-index
+{
+  "settings": {
+    "analysis": {
+      "filter": {
+        "russian_stop": {
+          "type": "stop",
+          "stopwords": "_russian_"
+        },
+        "russian_stemmer": {
+          "type": "stemmer",
+          "language": "russian"
+        },
+        "russian_keywords": {
+          "type": "keyword_marker",
+          "keywords": []
+        }
+      },
+      "analyzer": {
+        "russian_analyzer": {
+          "type": "custom",
+          "tokenizer": "standard",
+          "filter": [
+            "lowercase",
+            "russian_stop",
+            "russian_keywords",
+            "russian_stemmer"
+          ]
+        }
+      }
+    }
+  },
+  "mappings": {
+    "properties": {
+      "content": {
+        "type": "text",
+        "analyzer": "russian_analyzer"
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Generated tokens
+
+Use the following request to examine the tokens generated using the analyzer:
+
+```json
+POST /russian-index/_analyze
+{
+  "field": "content",
+  "text": "Студенты учатся в университетах России. Их номера 123456."
+}
+```
+{% include copy-curl.html %}
+
+The response contains the generated tokens:
+
+```json
+{
+  "tokens": [
+    {
+      "token": "студент",
+      "start_offset": 0,
+      "end_offset": 8,
+      "type": "<ALPHANUM>",
+      "position": 0
+    },
+    {
+      "token": "учат",
+      "start_offset": 9,
+      "end_offset": 15,
+      "type": "<ALPHANUM>",
+      "position": 1
+    },
+    {
+      "token": "университет",
+      "start_offset": 18,
+      "end_offset": 31,
+      "type": "<ALPHANUM>",
+      "position": 3
+    },
+    {
+      "token": "росс",
+      "start_offset": 32,
+      "end_offset": 38,
+      "type": "<ALPHANUM>",
+      "position": 4
+    },
+    {
+      "token": "номер",
+      "start_offset": 43,
+      "end_offset": 49,
+      "type": "<ALPHANUM>",
+      "position": 6
+    },
+    {
+      "token": "123456",
+      "start_offset": 50,
+      "end_offset": 56,
+      "type": "<NUM>",
+      "position": 7
+    }
+  ]
+}
+```
\ No newline at end of file
diff --git a/_analyzers/language-analyzers/sorani.md b/_analyzers/language-analyzers/sorani.md
new file mode 100644
index 0000000000..f71d43c481
--- /dev/null
+++ b/_analyzers/language-analyzers/sorani.md
@@ -0,0 +1,168 @@
+---
+layout: default
+title: Sorani
+parent: Language analyzers
+grand_parent: Analyzers
+nav_order: 290
+---
+
+# Sorani analyzer
+
+The built-in `sorani` analyzer can be applied to a text field using the following command:
+
+```json
+PUT /sorani-index
+{
+  "mappings": {
+    "properties": {
+      "content": {
+        "type": "text",
+        "analyzer": "sorani"
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Stem exclusion
+
+You can use `stem_exclusion` with this language analyzer using the following command:
+
+```json
+PUT index_with_stem_exclusion_sorani_analyzer
+{
+  "settings": {
+    "analysis": {
+      "analyzer": {
+        "stem_exclusion_sorani_analyzer": {
+          "type": "sorani",
+          "stem_exclusion": ["مؤسسه", "اجازه"]
+        }
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Sorani analyzer internals
+
+The `sorani` analyzer is built using the following components:
+
+- Tokenizer: `standard`
+
+- Token filters:
+  - normalization (Sorani)
+  - lowercase
+  - decimal_digit
+  - stop (Sorani)
+  - keyword
+  - stemmer (Sorani)
+
+## Custom Sorani analyzer
+
+You can create a custom Sorani analyzer using the following command:
+
+```json
+PUT /sorani-index
+{
+  "settings": {
+    "analysis": {
+      "filter": {
+        "sorani_stop": {
+          "type": "stop",
+          "stopwords": "_sorani_"
+        },
+        "sorani_stemmer": {
+          "type": "stemmer",
+          "language": "sorani"
+        },
+        "sorani_keywords": {
+          "type": "keyword_marker",
+          "keywords": []
+        }
+      },
+      "analyzer": {
+        "sorani_analyzer": {
+          "type": "custom",
+          "tokenizer": "standard",
+          "filter": [
+            "lowercase",
+            "decimal_digit",
+            "sorani_stop",
+            "sorani_keywords",
+            "sorani_stemmer"
+          ]
+        }
+      }
+    }
+  },
+  "mappings": {
+    "properties": {
+      "content": {
+        "type": "text",
+        "analyzer": "sorani_analyzer"
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Generated tokens
+
+Use the following request to examine the tokens generated using the analyzer:
+
+```json
+POST /sorani-index/_analyze
+{
+  "field": "content",
+  "text": "خوێندنی فەرمی لە هەولێرەوە. ژمارەکان ١٢٣٤٥٦."
+}
+```
+{% include copy-curl.html %}
+
+The response contains the generated tokens:
+
+```json
+{
+  "tokens": [
+    {
+      "token": "خوێندن",
+      "start_offset": 0,
+      "end_offset": 7,
+      "type": "<ALPHANUM>",
+      "position": 0
+    },
+    {
+      "token": "فەرم",
+      "start_offset": 8,
+      "end_offset": 13,
+      "type": "<ALPHANUM>",
+      "position": 1
+    },
+    {
+      "token": "هەولێر",
+      "start_offset": 17,
+      "end_offset": 26,
+      "type": "<ALPHANUM>",
+      "position": 3
+    },
+    {
+      "token": "ژمار",
+      "start_offset": 28,
+      "end_offset": 36,
+      "type": "<ALPHANUM>",
+      "position": 4
+    },
+    {
+      "token": "123456",
+      "start_offset": 37,
+      "end_offset": 43,
+      "type": "<NUM>",
+      "position": 5
+    }
+  ]
+}
+```
\ No newline at end of file
diff --git a/_analyzers/language-analyzers/spanish.md b/_analyzers/language-analyzers/spanish.md
new file mode 100644
index 0000000000..8a0d8fad3c
--- /dev/null
+++ b/_analyzers/language-analyzers/spanish.md
@@ -0,0 +1,172 @@
+---
+layout: default
+title: Spanish
+parent: Language analyzers
+grand_parent: Analyzers
+nav_order: 300
+---
+
+# Spanish analyzer
+
+The built-in `spanish` analyzer can be applied to a text field using the following command:
+
+```json
+PUT /spanish-index
+{
+  "mappings": {
+    "properties": {
+      "content": {
+        "type": "text",
+        "analyzer": "spanish"
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Stem exclusion
+
+You can use `stem_exclusion` with this language analyzer using the following command:
+
+```json
+PUT index_with_stem_exclusion_spanish_analyzer
+{
+  "settings": {
+    "analysis": {
+      "analyzer": {
+        "stem_exclusion_spanish_analyzer": {
+          "type": "spanish",
+          "stem_exclusion": ["autoridad", "aprobación"]
+        }
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Spanish analyzer internals
+
+The `spanish` analyzer is built using the following components:
+
+- Tokenizer: `standard`
+
+- Token filters:
+  - lowercase
+  - stop (Spanish)
+  - keyword
+  - stemmer (Spanish)
+
+## Custom Spanish analyzer
+
+You can create a custom Spanish analyzer using the following command:
+
+```json
+PUT /spanish-index
+{
+  "settings": {
+    "analysis": {
+      "filter": {
+        "spanish_stop": {
+          "type": "stop",
+          "stopwords": "_spanish_"
+        },
+        "spanish_stemmer": {
+          "type": "stemmer",
+          "language": "light_spanish"
+        },
+        "spanish_keywords": {
+          "type": "keyword_marker",
+          "keywords": []
+        }
+      },
+      "analyzer": {
+        "spanish_analyzer": {
+          "type": "custom",
+          "tokenizer": "standard",
+          "filter": [
+            "lowercase",
+            "spanish_stop",
+            "spanish_keywords",
+            "spanish_stemmer"
+          ]
+        }
+      }
+    }
+  },
+  "mappings": {
+    "properties": {
+      "content": {
+        "type": "text",
+        "analyzer": "spanish_analyzer"
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Generated tokens
+
+Use the following request to examine the tokens generated using the analyzer:
+
+```json
+POST /spanish-index/_analyze
+{
+  "field": "content",
+  "text": "Los estudiantes estudian en universidades españolas. Sus números son 123456."
+}
+```
+{% include copy-curl.html %}
+
+The response contains the generated tokens:
+
+```json
+{
+  "tokens": [
+    {
+      "token": "estudiant",
+      "start_offset": 4,
+      "end_offset": 15,
+      "type": "<ALPHANUM>",
+      "position": 1
+    },
+    {
+      "token": "estudian",
+      "start_offset": 16,
+      "end_offset": 24,
+      "type": "<ALPHANUM>",
+      "position": 2
+    },
+    {
+      "token": "universidad",
+      "start_offset": 28,
+      "end_offset": 41,
+      "type": "<ALPHANUM>",
+      "position": 4
+    },
+    {
+      "token": "español",
+      "start_offset": 42,
+      "end_offset": 51,
+      "type": "<ALPHANUM>",
+      "position": 5
+    },
+    {
+      "token": "numer",
+      "start_offset": 57,
+      "end_offset": 64,
+      "type": "<ALPHANUM>",
+      "position": 7
+    },
+    {
+      "token": "123456",
+      "start_offset": 69,
+      "end_offset": 75,
+      "type": "<NUM>",
+      "position": 9
+    }
+  ]
+}
+```
\ No newline at end of file
diff --git a/_analyzers/language-analyzers/swedish.md b/_analyzers/language-analyzers/swedish.md
new file mode 100644
index 0000000000..9da595f12e
--- /dev/null
+++ b/_analyzers/language-analyzers/swedish.md
@@ -0,0 +1,172 @@
+---
+layout: default
+title: Swedish
+parent: Language analyzers
+grand_parent: Analyzers
+nav_order: 310
+---
+
+# Swedish analyzer
+
+The built-in `swedish` analyzer can be applied to a text field using the following command:
+
+```json
+PUT /swedish-index
+{
+  "mappings": {
+    "properties": {
+      "content": {
+        "type": "text",
+        "analyzer": "swedish"
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Stem exclusion
+
+You can use `stem_exclusion` with this language analyzer using the following command:
+
+```json
+PUT index_with_stem_exclusion_swedish_analyzer
+{
+  "settings": {
+    "analysis": {
+      "analyzer": {
+        "stem_exclusion_swedish_analyzer": {
+          "type": "swedish",
+          "stem_exclusion": ["myndighet", "godkännande"]
+        }
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Swedish analyzer internals
+
+The `swedish` analyzer is built using the following components:
+
+- Tokenizer: `standard`
+
+- Token filters:
+  - lowercase
+  - stop (Swedish)
+  - keyword
+  - stemmer (Swedish)
+
+## Custom Swedish analyzer
+
+You can create a custom Swedish analyzer using the following command:
+
+```json
+PUT /swedish-index
+{
+  "settings": {
+    "analysis": {
+      "filter": {
+        "swedish_stop": {
+          "type": "stop",
+          "stopwords": "_swedish_"
+        },
+        "swedish_stemmer": {
+          "type": "stemmer",
+          "language": "swedish"
+        },
+        "swedish_keywords": {
+          "type": "keyword_marker",
+          "keywords": []
+        }
+      },
+      "analyzer": {
+        "swedish_analyzer": {
+          "type": "custom",
+          "tokenizer": "standard",
+          "filter": [
+            "lowercase",
+            "swedish_stop",
+            "swedish_keywords",
+            "swedish_stemmer"
+          ]
+        }
+      }
+    }
+  },
+  "mappings": {
+    "properties": {
+      "content": {
+        "type": "text",
+        "analyzer": "swedish_analyzer"
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Generated tokens
+
+Use the following request to examine the tokens generated using the analyzer:
+
+```json
+POST /swedish-index/_analyze
+{
+  "field": "content",
+  "text": "Studenter studerar vid svenska universitet. Deras nummer är 123456."
+}
+```
+{% include copy-curl.html %}
+
+The response contains the generated tokens:
+
+```json
+{
+  "tokens": [
+    {
+      "token": "student",
+      "start_offset": 0,
+      "end_offset": 9,
+      "type": "<ALPHANUM>",
+      "position": 0
+    },
+    {
+      "token": "studer",
+      "start_offset": 10,
+      "end_offset": 18,
+      "type": "<ALPHANUM>",
+      "position": 1
+    },
+    {
+      "token": "svensk",
+      "start_offset": 23,
+      "end_offset": 30,
+      "type": "<ALPHANUM>",
+      "position": 3
+    },
+    {
+      "token": "universitet",
+      "start_offset": 31,
+      "end_offset": 42,
+      "type": "<ALPHANUM>",
+      "position": 4
+    },
+    {
+      "token": "numm",
+      "start_offset": 50,
+      "end_offset": 56,
+      "type": "<ALPHANUM>",
+      "position": 6
+    },
+    {
+      "token": "123456",
+      "start_offset": 60,
+      "end_offset": 66,
+      "type": "<NUM>",
+      "position": 8
+    }
+  ]
+}
+```
\ No newline at end of file
diff --git a/_analyzers/language-analyzers/thai.md b/_analyzers/language-analyzers/thai.md
new file mode 100644
index 0000000000..e4daa1f0be
--- /dev/null
+++ b/_analyzers/language-analyzers/thai.md
@@ -0,0 +1,132 @@
+---
+layout: default
+title: Thai
+parent: Language analyzers
+grand_parent: Analyzers
+nav_order: 320
+---
+
+# Thai analyzer
+
+The built-in `thai` analyzer can be applied to a text field using the following command:
+
+```json
+PUT /thai-index
+{
+  "mappings": {
+    "properties": {
+      "content": {
+        "type": "text",
+        "analyzer": "thai"
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Stem exclusion
+
+You can use `stem_exclusion` with this language analyzer using the following command:
+
+```json
+PUT index_with_stem_exclusion_thai_analyzer
+{
+  "settings": {
+    "analysis": {
+      "analyzer": {
+        "stem_exclusion_thai_analyzer": {
+          "type": "thai",
+          "stem_exclusion": ["อำนาจ", "การอนุมัติ"]
+        }
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Thai analyzer internals
+
+The `thai` analyzer is built using the following components:
+
+- Tokenizer: `thai`
+
+- Token filters:
+  - lowercase
+  - decimal_digit
+  - stop (Thai)
+  - keyword
+
+## Custom Thai analyzer
+
+You can create a custom Thai analyzer using the following command:
+
+```json
+PUT /thai-index
+{
+  "settings": {
+    "analysis": {
+      "filter": {
+        "thai_stop": {
+          "type": "stop",
+          "stopwords": "_thai_"
+        },
+        "thai_keywords": {
+          "type": "keyword_marker",
+          "keywords": []
+        }
+      },
+      "analyzer": {
+        "thai_analyzer": {
+          "tokenizer": "thai",
+          "filter": [
+            "lowercase",
+            "decimal_digit",
+            "thai_stop",
+            "thai_keywords"
+          ]
+        }
+      }
+    }
+  },
+  "mappings": {
+    "properties": {
+      "content": {
+        "type": "text",
+        "analyzer": "thai_analyzer"
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Generated tokens
+
+Use the following request to examine the tokens generated using the analyzer:
+
+```json
+POST /thai-index/_analyze
+{
+  "field": "content",
+  "text": "นักเรียนกำลังศึกษาอยู่ที่มหาวิทยาลัยไทย หมายเลข 123456."
+}
+```
+{% include copy-curl.html %}
+
+The response contains the generated tokens:
+
+```json
+{
+  "tokens": [
+    {"token": "นักเรียน","start_offset": 0,"end_offset": 8,"type": "word","position": 0},
+    {"token": "กำลัง","start_offset": 8,"end_offset": 13,"type": "word","position": 1},
+    {"token": "ศึกษา","start_offset": 13,"end_offset": 18,"type": "word","position": 2},
+    {"token": "มหาวิทยาลัย","start_offset": 25,"end_offset": 36,"type": "word","position": 5},
+    {"token": "ไทย","start_offset": 36,"end_offset": 39,"type": "word","position": 6},
+    {"token": "หมายเลข","start_offset": 40,"end_offset": 47,"type": "word","position": 7},
+    {"token": "123456","start_offset": 48,"end_offset": 54,"type": "word","position": 8}
+  ]
+}
+```
\ No newline at end of file
diff --git a/_analyzers/language-analyzers/turkish.md b/_analyzers/language-analyzers/turkish.md
new file mode 100644
index 0000000000..fb36c5413c
--- /dev/null
+++ b/_analyzers/language-analyzers/turkish.md
@@ -0,0 +1,143 @@
+---
+layout: default
+title: Turkish
+parent: Language analyzers
+grand_parent: Analyzers
+nav_order: 330
+---
+
+# Turkish analyzer
+
+The built-in `turkish` analyzer can be applied to a text field using the following command:
+
+```json
+PUT /turkish-index
+{
+  "mappings": {
+    "properties": {
+      "content": {
+        "type": "text",
+        "analyzer": "turkish"
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Stem exclusion
+
+You can use `stem_exclusion` with this language analyzer using the following command:
+
+```json
+PUT index_with_stem_exclusion_turkish_analyzer
+{
+  "settings": {
+    "analysis": {
+      "analyzer": {
+        "stem_exclusion_turkish_analyzer": {
+          "type": "turkish",
+          "stem_exclusion": ["otorite", "onay"]
+        }
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Turkish analyzer internals
+
+The `turkish` analyzer is built using the following components:
+
+- Tokenizer: `standard`
+
+- Token filters:
+  - apostrophe
+  - lowercase (Turkish)
+  - stop (Turkish)
+  - keyword
+  - stemmer (Turkish)
+
+## Custom Turkish analyzer
+
+You can create a custom Turkish analyzer using the following command:
+
+```json
+PUT /turkish-index
+{
+  "settings": {
+    "analysis": {
+      "filter": {
+        "turkish_stop": {
+          "type": "stop",
+          "stopwords": "_turkish_"
+        },
+        "turkish_stemmer": {
+          "type": "stemmer",
+          "language": "turkish"
+        },
+        "turkish_lowercase": {
+          "type":       "lowercase",
+          "language":   "turkish"
+        },
+        "turkish_keywords": {
+          "type": "keyword_marker",
+          "keywords": []
+        }
+      },
+      "analyzer": {
+        "turkish_analyzer": {
+          "type": "custom",
+          "tokenizer": "standard",
+          "filter": [
+            "apostrophe",
+            "turkish_lowercase",
+            "turkish_stop",
+            "turkish_keywords",
+            "turkish_stemmer"
+          ]
+        }
+      }
+    }
+  },
+  "mappings": {
+    "properties": {
+      "content": {
+        "type": "text",
+        "analyzer": "turkish_analyzer"
+      }
+    }
+  }
+}
+```
+{% include copy-curl.html %}
+
+## Generated tokens
+
+Use the following request to examine the tokens generated using the analyzer:
+
+```json
+POST /turkish-index/_analyze
+{
+  "field": "content",
+  "text": "Öğrenciler Türk üniversitelerinde öğrenim görüyor. Numara 123456."
+}
+```
+{% include copy-curl.html %}
+
+The response contains the generated tokens:
+
+```json
+{
+  "tokens": [
+    {"token": "öğrenci","start_offset": 0,"end_offset": 10,"type": "<ALPHANUM>","position": 0},
+    {"token": "türk","start_offset": 11,"end_offset": 15,"type": "<ALPHANUM>","position": 1},
+    {"token": "üniversite","start_offset": 16,"end_offset": 33,"type": "<ALPHANUM>","position": 2},
+    {"token": "öğre","start_offset": 34,"end_offset": 41,"type": "<ALPHANUM>","position": 3},
+    {"token": "görüyor","start_offset": 42,"end_offset": 49,"type": "<ALPHANUM>","position": 4},
+    {"token": "numar","start_offset": 51,"end_offset": 57,"type": "<ALPHANUM>","position": 5},
+    {"token": "123456","start_offset": 58,"end_offset": 64,"type": "<NUM>","position": 6}
+  ]
+}
+```
\ No newline at end of file
diff --git a/_analyzers/supported-analyzers/index.md b/_analyzers/supported-analyzers/index.md
index 5616936179..43e41b8d6a 100644
--- a/_analyzers/supported-analyzers/index.md
+++ b/_analyzers/supported-analyzers/index.md
@@ -24,12 +24,12 @@ Analyzer | Analysis performed | Analyzer output
 **Stop** | - Parses strings into tokens on any non-letter character <br> - Removes non-letter characters <br> - Removes stop words <br> - Converts tokens to lowercase | [`s`, `fun`, `contribute`, `brand`, `new`, `pr`, `opensearch`]
 **Keyword** (no-op) | - Outputs the entire string unchanged | [`It’s fun to contribute a brand-new PR or 2 to OpenSearch!`]
 **Pattern** | - Parses strings into tokens using regular expressions <br> - Supports converting strings to lowercase <br> - Supports removing stop words | [`it`, `s`, `fun`, `to`, `contribute`, `a`,`brand`, `new`, `pr`, `or`, `2`, `to`, `opensearch`]
-[**Language**]({{site.url}}{{site.baseurl}}/analyzers/language-analyzers/) | Performs analysis specific to a certain language (for example, `english`). | [`fun`, `contribut`, `brand`, `new`, `pr`, `2`, `opensearch`]
+[**Language**]({{site.url}}{{site.baseurl}}/analyzers/language-analyzers/index/) | Performs analysis specific to a certain language (for example, `english`). | [`fun`, `contribut`, `brand`, `new`, `pr`, `2`, `opensearch`]
 **Fingerprint** | - Parses strings on any non-letter character <br> - Normalizes characters by converting them to ASCII <br> - Converts tokens to lowercase <br> - Sorts, deduplicates, and concatenates tokens into a single token <br> - Supports removing stop words | [`2 a brand contribute fun it's new opensearch or pr to`] <br> Note that the apostrophe was converted to its ASCII counterpart.
 
 ## Language analyzers
 
-OpenSearch supports multiple language analyzers. For more information, see [Language analyzers]({{site.url}}{{site.baseurl}}/analyzers/language-analyzers/).
+OpenSearch supports multiple language analyzers. For more information, see [Language analyzers]({{site.url}}{{site.baseurl}}/analyzers/language-analyzers/index).
 
 ## Additional analyzers