97 lines
2.9 KiB
Plaintext
97 lines
2.9 KiB
Plaintext
URL: https://github.com/hyphenation/tex-hyphen
|
|
Version: ecf976ab6995acb653d38ab1af0b9b9829ec0c77
|
|
License: various
|
|
License File: NOTICE
|
|
|
|
The tex-hyphen package contains hyphenation patterns under several different licenses.
|
|
Patterns licensed under the following open source licenses can be imported here:
|
|
* Apache 2.0
|
|
* Unlicense
|
|
* MIT
|
|
* BSD
|
|
* MPL
|
|
* LGPL 2.1
|
|
|
|
Description:
|
|
Hyphenation patterns for automatic hyphenation from TeX hyphenation patterns
|
|
project at http://www.hyphenation.org.
|
|
|
|
A few languages are from other sources with the individual repositories as listed below:
|
|
Czech (cs) - https://github.com/tensojka/cshyphen
|
|
Dutch (nl) - https://github.com/OpenTaal/opentaal-hunspell
|
|
Swedish (sv) - https://cgit.freedesktop.org/libreoffice/dictionaries/tree/sv_SE/hyph_sv.dic
|
|
Russian (ru) - https://github.com/laboratory50/russian-spellpack
|
|
|
|
Local Modifications:
|
|
The patterns are taken from the hyph-utf8/tex/generic/hyph-utf8/patterns/txt directory.
|
|
|
|
Only the following patterns are included from that directory:
|
|
|
|
* Amharic (am)
|
|
* Armenian (hy)
|
|
* Assamese (as)
|
|
* Basque (eu)
|
|
* Belarusian (be)
|
|
* Bengali (bn)
|
|
* Bulgarian (bg)
|
|
* Church Slavonic (cu)
|
|
* Croation (hr)
|
|
* Danish (da)
|
|
* Estonian (et)
|
|
* Ethiopic script (und-Ethi)
|
|
* French (fr)
|
|
* Galician (gl)
|
|
* German (de)
|
|
* Gujarati (gu)
|
|
* Hindi (hi)
|
|
* Hungarian (hu)
|
|
* Irish (ga)
|
|
* Kannada (kn)
|
|
* Latin (la)
|
|
* Latvian (lv)
|
|
* Lithuanian (lt)
|
|
* Malayalam (ml)
|
|
* Marathi (mr)
|
|
* Mongolian written in Cyrillic (mn-cyrl)
|
|
* Norwegian Bokmål (nb)
|
|
* Norwegian Nynorsk (nn)
|
|
* Oriya (or)
|
|
* Portuguese (pt)
|
|
* Punjabi (pa)
|
|
* Sanskrit (sa)
|
|
* Slovenian (sl)
|
|
* Spanish (es)
|
|
* Tamil (ta)
|
|
* Telugu (te)
|
|
* Turkmen (tk)
|
|
* Ukrainian (uk)
|
|
* US English (en-US)
|
|
* Welsh (cy)
|
|
|
|
The Ethiopic script pattern files have been renamed from mul-ethi to und-ethi,
|
|
to properly follow BCP 47 naming.
|
|
|
|
In case hyphenation patterns are licensed under dual licenses,
|
|
hyph-{lang}.lic.txt files are edited to keep only the license Android applies
|
|
as listed in the corresponding NOTICE files.
|
|
|
|
For the German (de) patterns, the *.chr.txt has been automatically generated
|
|
based on the contents of the pattern files, using the the create_chr.py script
|
|
created by Android engineers.
|
|
|
|
British English (en-GB) patterns are taken from a different part of the
|
|
tex-hyphen repository. Please see the README.android file in the 'en-GB'
|
|
directory.
|
|
|
|
US English (en-US) patterns have been modified to remove patterns added to
|
|
the end of the file that had been incorrectly computed based on the TUGboat
|
|
hyphenation exception list. To compensate for that, the TUGboat hyphenation
|
|
exception list for US English has been downloaded from
|
|
<http://mirror.ctan.org/info/digests/tugboat/hyphenex/ushyphex.tex> and
|
|
explicitly added to the end of the hyphenation exception file for US
|
|
English.
|
|
|
|
Swedish (sv) patterns are converted to utf-8 and the first three
|
|
lines(encoding, leftmin, rightmin) are removed to preserve the hyphenation
|
|
pattern only in *.pat.txt
|