elixir-unicode
diff --git a/‎.github/workflows/ci.yml‎
Lines changed: 4 additions & 10 deletions b/‎.github/workflows/ci.yml‎
Lines changed: 4 additions & 10 deletions
diff --git a/‎CHANGELOG.md‎
Lines changed: 11 additions & 3 deletions b/‎CHANGELOG.md‎
Lines changed: 11 additions & 3 deletions
diff --git a/‎README.md‎
Lines changed: 86 additions & 10 deletions b/‎README.md‎
Lines changed: 86 additions & 10 deletions
diff --git a/‎lib/unicode/break.ex‎
Lines changed: 1 addition & 0 deletions b/‎lib/unicode/break.ex‎
Lines changed: 1 addition & 0 deletions
diff --git a/‎lib/unicode/segment.ex‎
Lines changed: 1 addition & 1 deletion b/‎lib/unicode/segment.ex‎
Lines changed: 1 addition & 1 deletion
@@ -8,22 +8,19 @@ on:
 
 jobs:
   test:
-    runs-on: ubuntu-18.04
+    runs-on: ubuntu-20.04
     env:
       MIX_ENV: test
     strategy:
       fail-fast: false
       matrix:
         include:
-          - pair:
-              elixir: 1.8.2
-              otp: 20.3.8.26
           - pair:
               elixir: 1.14.0
               otp: 25.1
             lint: lint
     steps:
-      - uses: actions/checkout@v2
+      - uses: actions/checkout@v3
 
       - uses: erlef/setup-beam@v1
         with:
@@ -42,12 +39,9 @@ jobs:
       - run: mix format --check-formatted
         if: ${{ matrix.lint }}
 
-      - run: mix deps.unlock --check-unused
-        if: ${{ matrix.lint }}
-
-      - run: mix credo --strict check
+      - run: mix deps --check-unused
         if: ${{ matrix.lint }}
-
+        
       - run: mix deps.compile
 
       - run: mix compile --warnings-as-errors
 
@@ -1,13 +1,21 @@
 # Changelog
 
-## Unicode String v1.1.1
+## Unicode String v1.2.1
 
-This is the changelog for Unicode String v1.1.1 released on June 2nd, 2023.  For older changelogs please consult the release tag on [GitHub](https://github.com/elixir-unicode/unicode_string/tags)
+This is the changelog for Unicode String v1.2.1 released on June 2nd, 2023.  For older changelogs please consult the release tag on [GitHub](https://github.com/elixir-unicode/unicode_string/tags)
 
 ### Bug Fixes
 
 * Resolve segments dir at runtime, not compile time. Thanks to @crkent for the report. Closes #4.
 
+## Unicode String v1.2.0
+
+This is the changelog for Unicode String v1.2.0 released on March 14th, 2023.  For older changelogs please consult the release tag on [GitHub](https://github.com/elixir-unicode/unicode_string/tags)
+
+### Enhancements
+
+* Adds `Unicode.String.stream/2` to support streaming graphemes, words, sentences and line breaks.
+
 ## Unicode String v1.1.0
 
 This is the changelog for Unicode String v1.1.0 released on September 21st, 2022.  For older changelogs please consult the release tag on [GitHub](https://github.com/elixir-unicode/unicode_string/tags)
@@ -46,7 +54,7 @@ This is the changelog for Unicode String v0.2.0 released on July 12th, 2020.  Fo
 
 ### Enhancements
 
-This release implements the Unicode break rules for graphemes, words, lines and sentences.
+This release implements the Unicode break rules for graphemes, words, lines (word-wrapping) and sentences.
 
 * Adds `Unicode.String.split/2`
 
 
@@ -2,24 +2,100 @@
 
 Adds functions supporting some string algorithms in the Unicode standard. For example:
 
-* `Unicode.String.fold/1,2` that applies the [Unicode Case Folding algorithm](https://www.unicode.org/versions/Unicode14.0.0/ch03.pdf)
+* The [Unicode Case Folding](https://www.unicode.org/versions/Unicode15.0.0/ch03.pdf) algorithm to provide case-independent equality checking irrespective of language or script with `Unicode.String.fold/2` and `Unicode.String.equals_ignoring_case?/2`
 
-* `Unicode.String.equals_ignoring_case?/2` that compares two strings for equality after applying `Unicode.String.fold/2` to the arguments.
+* The [Unicode Segmentation](https://unicode.org/reports/tr29/) algorithm to detect, break, split or stream strings into grapheme clusters, words, sentences and line break points.
 
-## Examples
+* The [Unicode Line Breaking](https://www.unicode.org/reports/tr14/) algorithm to determine line breaks (as in breaks where word-wrapping would be acceptable).
 
-		iex> Unicode.String.equals_ignoring_case? "ABC", "abc"
-		true
+## Casing
 
-		iex> Unicode.String.equals_ignoring_case? "beißen", "beissen"
-		true
+The [Unicode Case Folding](https://www.unicode.org/versions/Unicode15.0.0/ch03.pdf) algorithm defines how to perform case folding. This allows comparison of strings in a case-insensitive fashion. It does not define the means to compare ignoring diacritical marks (accents). Some examples follow, for details see:
 
-		iex> Unicode.String.equals_ignoring_case? "grüßen", "grussen"
-		false
+* `Unicode.String.fold/2`
+* `Unicode.String.equals_ignoring_case?/3`
+
+```elixir
+iex> Unicode.String.equals_ignoring_case? "ABC", "abc"
+true
+
+iex> Unicode.String.equals_ignoring_case? "beißen", "beissen"
+true
+
+iex> Unicode.String.equals_ignoring_case? "grüßen", "grussen"
+false
+```
+## Segmentation
+
+The [Unicode Segmentation](https://unicode.org/reports/tr29/) annex details the algorithm to be applied with segmenting text (Elixir strings) into words, sentences, graphemes and line breaks. Some examples follow, for details see:
+
+* `Unicode.String.split/2`
+* `Unicode.String.break?/2`
+* `Unicode.String.break/2`
+* `Unicode.String.splitter/2`
+* `Unicode.String.next/2`
+* `Unicode.String.stream/2`
+
+```elixir
+# Split text at a word boundary.
+iex> Unicode.String.split "This is a sentence. And another.", break: :word
+["This", " ", "is", " ", "a", " ", "sentence", ".", " ", "And", " ", "another", "."]
+
+# Split text at a word boundary but omit any whitespace
+iex> Unicode.String.split "This is a sentence. And another.", break: :word, trim: true
+["This", "is", "a", "sentence", ".", "And", "another", "."]
+
+# Split text at a sentence boundary.
+iex> Unicode.String.split "This is a sentence. And another.", break: :sentence
+["This is a sentence. ", "And another."]
+
+# By default, common abbreviations are suppressed (ie
+# the do not cause a break)
+iex> Unicode.String.split "No, I don't have a Ph.D. but I don't think it matters.", break: :word, trim: true
+["No", ",", "I", "don't", "have", "a", "Ph.D", ".", "but", "I", "don't",
+ "think", "it", "matters", "."]
+
+iex> Unicode.String.split "No, I don't have a Ph.D. but I don't think it matters.", break: :sentence, trim: true
+["No, I don't have a Ph.D. but I don't think it matters."]
+
+# Sentence Break suppressions are locale sensitive.
+iex> Unicode.String.Segment.known_locales
+["de", "el", "en", "en-US", "en-US-POSIX", "es", "fi", "fr", "it", "ja", "pt",
+ "root", "ru", "sv", "zh", "zh-Hant"]
+
+iex> Unicode.String.split "Non, c'est M. Dubois.", break: :sentence, trim: true, locale: "fr"
+["Non, c'est M. Dubois."]
+
+# Note that break: :line does NOT mean split the string
+# at newlines. It splits the string where a line break would be
+# acceptable. This is very useful for calculating where
+# to perform word-wrap on some text.
+iex> Unicode.String.split "This is a sentence. And another.", break: :line
+["This ", "is ", "a ", "sentence. ", "And ", "another."]
+```
+
+## Segment Streaming
+
+Segmentation can also be streamed using `Unicode.String.stream/2`. For large strings this may improve memory usage since the intermediate segments will be garbage collected when they fall out of scope.
+
+```elixir
+iex> Enum.to_list Unicode.String.stream("this is a set of words", trim: true)                       ["this", "is", "a", "set", "of", "words"]
+
+iex> Enum.map Unicode.String.stream("this is a set of words", trim: true),
+...>   fn word -> %{word: word, length: String.length(word)} end
+[
+  %{length: 4, word: "this"},
+  %{length: 2, word: "is"},
+  %{length: 1, word: "a"},
+  %{length: 3, word: "set"},
+  %{length: 2, word: "of"},
+  %{length: 5, word: "words"}
+]
+```
 
 ## Installation
 
-The package can be installed by adding `unicode_string` to your list of dependencies in `mix.exs`:
+The package can be installed by adding `:unicode_string` to your list of dependencies in `mix.exs`:
 
 ```elixir
 def deps do
 
@@ -189,6 +189,7 @@ defmodule Unicode.String.Break do
   #      "S.A.", "Up.", "Job.", "Num.", "M.I.T.", "Ok.", "Org.", "Ex.", "Cont.", "U.",
   #      "Mart.", "Fn.", "Abs.", "Lt.", "OK.", "Z.", "E.", "Kb.", "Est.", "A.M.",
   #      "L.A.", ...]
+
   defp suppressions_rule(locale, segment_type)
 
   for locale <- Segment.known_locales() do
 
@@ -152,7 +152,7 @@ defmodule Unicode.String.Segment do
   end
 
   @doc """
-  Evaludates a list of rules against a given
+  Evaluates a list of rules against a given
   string.
 
   """