Skip to content

Access to the Unicode Character Database from Erlang

License

Notifications You must be signed in to change notification settings

isvilen/unicodedata

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

72 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Unicode support library

This Erlang library provides access to character properties defined in the Unicode Character Database and implements following algorithms specified in the Unicode 9.0.0 standard:

  • Default Case Algorithms specified in The Unicode Standard Section 3.13

    • Default Case Conversion
    • Default Case Folding
    • Default Caseless Matching
  • Unicode normalization specified in The Unicode Standard Annex #15

    • Normalization Form D
    • Normalization Form KD
    • Normalization Form C
    • Normalization Form KC
  • Unicode Text Segmentation specified in The Unicode Standard Annex #29

    • Grapheme Cluster Boundaries
    • Word Boundaries
    • Sentence Boundaries
  • Unicode Line Breaking Algorithm specified in The Unicode Standard Annex #14

  • Unicode Bidirectional Algorithm specified in The Unicode Standard Annex #9

Build

$ rebar3 compile

Documentation

Run rebar3 edoc and open generated doc/index.html file.

About

Access to the Unicode Character Database from Erlang

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages