GithubHelp home page GithubHelp logo

batak's People

Contributors

simoncozens avatar

Stargazers

 avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

batak's Issues

Major flaw with Batak Noto font

Defect Report

Title

Major flaw

Font

Noto Sans Batak-Regular

Where the font came from, and when

Font Version

OS name and version

Application name and version

Issue

I am Dr. Uli Kozok, specialist on Batak philology and palaeography at the University of Hawaii. I developed the first Batak font, and then with Michael Everson the first unicode font. Michael's Pangururan font has it's own flaws, but Noto has one major flaw - besides a whole range of minor flaws that I will address later - which is the most profound flaw. Daan van Esch suggested that I should post it here. Thank you!

If I want to write the Batak name Simbolon, the Batak Noto font allows me to write the sequence sim\bolon\ where \ represents the virama. However, this is not how the Batak script works. In the Batak script the vowel in a closed syllable (where the vowel is not a) must be written following the two consonants and before the virama. Why is this so? Nobody knows. None of the Indonesian scripts has this odd rule, but, well, it is how it is. Therefore the sequence sim\bolon\ (for Simbolon) is wrong. The correct way to write it, is smi\bolno. The latter correct form is "penalized" by displaying a little dotted circle marking it "wrong". (see screenshot sorry, don't know how to )

Character data

Please include real character data to illustrate your issue-- Unicode codepoints are helpful. This makes it possible for developers who don't know the language or script to copy/paste the text to reproduce the issue.

Screenshot

If possible, include a screenshot or an image illustrating the issue.
Annotations are also helpful.

Tools for reporting bugs

Useful tools for reporting bugs are available at: https://github.com/googlei18n/

Harfbuzz hb-view and hb-shape

These are part of the HarfBuzz distribution and can help isolate if an issue is in the app/OS, shaping engine, or font.

  • hb-view renders the text with the exact font (for example, to see how ligatured characters shape) using your installed version of HarfBuzz.

For example:

  hb-view --font-file {path to font} --text-file {path to text file} --output-file '{sample}.png'
  • hb-shape shows glyph selection and positioning

Fontview

  • Fontview displays the text.

Fontdiff

  • Fontdiff displays the text using two versions of the font side by side.

Poor positioning of U+1BFF BATAK SYMBOL BINDU PANGOLAT

Font

NotoSansBatak-Regular.ttf

Where the font came from, and when

Site: https://noto-website-2.storage.googleapis.com/pkgs/NotoSansBatak-unhinted.zip
Date: 2018-06-28

Font version

Version 2.000;GOOG;noto-source:20170915:90ef993387c0

Issue

U+1BFF BATAK SYMBOL BINDU PANGOLAT should curve around the preceding character. In this font, it is too high and does not interact with the preceding character. See L2/08-011R and Tobasche spraakkunst, pg. 22 for examples.

Character data

ᯤ᯿
U+1BE4 BATAK LETTER I
U+1BFF BATAK SYMBOL BINDU PANGOLAT

Screenshot

ᯤ᯿

The result of the first Toba Congress request to modify the "ba" character.

Is your feature request related to a problem? Please describe.
News: https://www.ipminews.co.id/rumusan-hari-ketiga-kongres-i-kebudayaan-batak-toba-pada-22-oktober-2022/
Archive: http://web.archive.org/web/20230714062818/https://www.ipminews.co.id/rumusan-hari-ketiga-kongres-i-kebudayaan-batak-toba-pada-22-oktober-2022/
Quote: "bentuk induk surat 'ba' perlu dipertimbangkan untuk menjadi bulat lonjong 'ba', karena ini yang lebih banyak di naskah"
Translation: "the [letter] 'ba' needs to be considered to be rounded oval 'ba', because this is the one that is more common in the text[s]"

One of the resolution of the congress in October 2022 was that the shape of "ba" should be modified into more elongated form, just like commonly seen in the Toba manuscripts for example "Pustaha laklak"
image

This request is on behalf of Surung Simanullang.

Describe the solution you'd like
1BC5 ᯅ BATAK LETTER BA should be modified to become more elongated (see example in the above image).
either the glyph is redrawn, or an alternative glyph is provided

Describe alternatives you've considered
Using its current form ᯅ is considered less common (rare form) based on the frequency of 'ba' form found in the Toba manuscripts

Additional context

image
Add MSS 156789 p 7: "balang do" ᯅᯞᯰ ᯑᯬ, the letter "ba"/ᯅ is more rounded and other manuscripts

Does not build

Error: In '../../../../../../tmp/tmped85hbsb/NotoSansBatak-Regular.ufo': Compiling UFO failed: /tmp/tmped85hbsb/NotoSansBatak-Regular.ufo/features.fea:15:1: If "languagesystem DFLT dflt" is present, it must be the first of the languagesystem statements

U+1BFF BATAK SYMBOL BINDU PANGOLAT overlaps marks and 'dist' should ignore U+1BE6 BATAK SIGN TOMPI

Font

NotoSansBatak-Regular.otf

Where the font came from, and when

Site: https://github.com/notofonts/batak/releases/tag/NotoSansBatak-v2.003
Date: 2023-10-06

Font version

Version 2.003

Issue

U+1BFF BATAK SYMBOL BINDU PANGOLAT can overlap the marks U+1BE8 and U+1BED in the preceding cluster. The 'dist' lookup fixes some overlaps but not all of them, and it is stymied by U+1BE6 BATAK SIGN TOMPI, which should be ignored in 'dist'. The examples given below are not exhaustive: this is a general problem.

Character data

ᯑᯭ᯿ᯂ᯦ᯨ᯿
U+1BD1 BATAK LETTER DA
U+1BED BATAK VOWEL SIGN KARO O
U+1BFF BATAK SYMBOL BINDU PANGOLAT
U+1BC2 BATAK LETTER HA
U+1BE6 BATAK SIGN TOMPI
U+1BE8 BATAK VOWEL SIGN PAKPAK E
U+1BFF BATAK SYMBOL BINDU PANGOLAT

Screenshot

ᯑᯭ᯿ᯂ᯦ᯨ᯿

Noto Sans Batak doesn't reorder as required by Unicode Standard

The Unicode Standard, section 17.6, specifies a reordering of glyphs in certain situations:

"The main peculiarity of Batak rendering concerns the reordering of the glyphs for vowel signs when one of the two killers, pangolat or panongonan, is used to close the syllable by killing the inherent vowel of a final consonant. This reordering for display is entirely regular. So, while the representation of the syllable /tip/ is done in logical order: <ta, vowel sign i, pa, pangolat>, when rendered for display the glyph for the vowel sign is visually applied to the final consonant, pa, rather than to the ta. The glyph for the pangolat always stays at the end of the syllable."

See
http://www.unicode.org/versions/Unicode8.0.0/ch17.pdf
and for examples
https://en.wikipedia.org/wiki/Batak_alphabet#Diacritic_reordering_for_closed_syllables

The Noto Sans Batak font does not implement this reordering. Note that this is not the kind of reordering that the Universal Shaping Engine takes care of; it must be implemented in the font.

To make matters worse, if the user tries to compensate for this bug by manually entering characters in the desired visual sequence, which is logically invalid because it has a virama directly following a vowel, the Universal Shaping Engine does not flag this sequence as invalid because the Batak viramas are classified as "pure killers".

Steps to Reproduce:

  1. Open Firefox.
  2. Go to http://lindenbergsoftware.com/google/noto/batak-reordering.html

Expected Results:
In the section "Vowel reordering", the strings should be rendered with the following glyph sequences:
ᯖ ᯇ ᯪ ᯲
ᯖ ᯇ ᯮ ᯲

Actual Results:
In the section "Vowel reordering", the strings are rendered with the following glyph sequences:
ᯖ ᯪ ᯇ ᯲
ᯖ ᯮ ᯇ ᯲

See screen shot.

Configuration:
Noto Sans Batak Regular 1.05. Firefox 44.0.1 on Mac OS X 10.11.3.

batak reordering

No ligatures with U+1BEE BATAK VOWEL SIGN U

Font

NotoSansBatak-Regular.ttf

Where the font came from, and when

Site: https://noto-website-2.storage.googleapis.com/pkgs/NotoSansBatak-unhinted.zip
Date: 2018-06-28

Font version

Version 2.000;GOOG;noto-source:20170915:90ef993387c0

Issue

The font has no ligatures with the vowel sign u.

Character data

ᯀᯮᯁᯮᯂᯮ
U+1BC0 BATAK LETTER A
U+1BEE BATAK VOWEL SIGN U
U+1BC1 BATAK LETTER SIMALUNGUN A
U+1BEE BATAK VOWEL SIGN U
U+1BC2 BATAK LETTER HA
U+1BEE BATAK VOWEL SIGN U

Screenshot

ᯀᯮᯁᯮᯂᯮ

The Mandailing 'na' letter is not correct

Defect Report

Noto fonts can be obtained from https://notofonts.github.io/
If you find a problem with a released font, then please use this template to file a defect report.
Before filing your report, please check if the issue has been fixed in the latest development build. You can get development builds by finding the font family on https://notofonts.github.io/ and following the link to "Development builds".

Title

The Mandailing 'na' letter is not correct

Writing this bug report on behalf of Askolani and Ikbal (https://btm.wiktionary.org/wiki/User:IHLubis), both are the recipients of GLAM Indonesia's digitization grants. Askolani doesn't have wiki account, but you can take a look at their CVs in https://meta.wikimedia.org/wiki/Wikimedia_Indonesia/GLAM_Indonesia/Hibah_Mini_2023/Digitalisasi_Manuskrip_Mandailing (in Indonesian language)

Font

NotoSansBatak-Regular.ttf

NotoSansBatak-Regular.zip

Where the font came from, and when

I believe from https://fonts.google.com/noto/specimen/Noto+Sans+Batak, that's where I usually download Noto fonts from. But I'm not 100% sure.

Date: 2021-04-01 (according to the file date on my folder)

Font Version

  • Win -- 3.1, August 2, 2020

Issue

  1. Type ᯊ
  2. Observed results

image

  1. Expected results

image
image

  1. Additional information
    See the recently uploaded Mandailing manuscripts (Pustaha): https://commons.wikimedia.org/wiki/Category:WikiPustaha_Mandailing

  2. Furthermore, Askolani and Ikbal shared a document with me showing all the differences between what's being used by the Mandailing community and what's being used in Noto Fonts
    Daftar tabel huruf Aksara (tulak tulak).docx

Character data

1BCA ᯊ BATAK LETTER MANDAILING NA

Incorrect glyph for 'u'

Title

Incorrect glyph for 'u'.
Writing this bug report on behalf of https://incubator.wikimedia.org/wiki/User_talk:Surung_Simanullang

Font

Full file name, for example 'NotoSansBatak-Regular.ttf'.
You can upload the problem font here unless it is a Chinese, Japanese or Korean font (these are large).
NotoSansBatak-Regular.zip

Where the font came from, and when

For example:
Site: I believe from https://fonts.google.com/noto/specimen/Noto+Sans+Batak, that's where I usually download Noto fonts from. But I'm not 100% sure.
Date: 2021-04-01 (according to the file date on my folder)

Font Version

  • Win -- 3.1, August 2, 2020

OS name and version

This is especially important if the font came pre-installed.

Application name and version

If the issue is observed using a specific app.

Issue

Summarize the issue briefly -- one paragraph preferred

  1. Write ᯖᯪᯀᯮᯰ ᯉᯪ ᯔᯉᯮᯂ᯲ in Noto Sans Batak (image 1), translit 'tiung ni manuk'
  2. The glyph for letter ᯀᯮ is incorrect, since it didn't display as the ᯀ (1BC0) + ᯮ (1BEE)
  3. Observed results (see image 1)
  4. Expected results: should look like ᯮ at the bottom right (see image 2)
  5. Additional information

Example from Pustaha Laklak (image 3) Add MS 15678, f. 10r https://www.bl.uk/manuscripts/Viewer.aspx?ref=add_ms_15678_f001r

Character data

Please include real character data to illustrate your issue-- Unicode codepoints are helpful. This makes it possible for developers who don't know the language or script to copy/paste the text to reproduce the issue.

  • ᯀ (1BC0) + ᯮ (1BEE) = ᯀᯮ , translit 'u'
  • which, according to Batak speaker, Surung_Simanullang, only occur as 'ung'
  • ᯀ (1BC0) + ᯮ (1BEE) + ᯰ (1BF0) = ᯀᯮᯰ, translit 'ung'
  • ᯖ (1BD6) + ᯪ (1BEA) + ᯀ (1BC0) + ᯮ (1BEE) + ᯰ (1BF0) = ᯖᯪᯀᯮᯰ in ᯖᯪᯀᯮᯰ ᯉᯪ ᯔᯉᯮᯂ᯲ translit 'tiung ni manuk'

Screenshot

If possible, include a screenshot or an image illustrating the issue.
Annotations are also helpful.

image
Image 1

image
Image 2

image
Image 3

Noto Sans Batak doesn't position as required by Unicode Standard

The Unicode Standard, section 17.6, specifies the positioning of glyphs in certain situations:

"The two consonant signs, U+1BF0 batak consonant sign ng and U+1BF1 batak consonant sign h, are nonspacing marks, usually rendered above the spacing vowel signs. When U+1BF0 batak consonant sign ng occurs together with the nonspacing mark, U+1BE9 batak vowel sign ee, both are rendered above the base consonant, with the glyph for the ee at the top left and the glyph for the ng at the top right."

See http://www.unicode.org/versions/Unicode8.0.0/ch17.pdf

The Noto Sans Batak font does not implement this positioning correctly in all situations.

The expected/actual results are about one case where a glyph is clearly positioned incorrectly. The screen shot highlights some additional cases in blue where the Unicode Standard doesn’t specify the positioning, but which are similar to the situation described in the Standard and should also be rechecked.

Steps to Reproduce:

  1. Open Firefox.
  2. Go to http://lindenbergsoftware.com/google/noto/batak-vowel-consonant.html

Expected Results:
For the syllable with vowel sign ee and final consonant ng, the glyph for the ee should be at the top left and the glyph for the ng at the top right.

Actual Results:
For the syllable with vowel sign ee and final consonant ng, the glyph for the ng is stacked above the glyph for the ee at the top left. The resulting glyph combination is indistinguishable from that for a syllable with sign tompi.

See the red highlight in the screen shot.

Configuration:
Noto Sans Batak Regular 1.05. Firefox 44.0.1 on Mac OS X 10.11.3.

batak vowel positioning

Consonant sign NG (amisara) should be above the letter, not vowel, in "letter+vowel+amisara" combination

Title

Consonant sign NG (amisara) should be above the letter, not vowel, in "letter+vowel+amisara" combination

Writing this bug report on behalf of https://incubator.wikimedia.org/wiki/User_talk:Surung_Simanullang

Font

NotoSansBatak-Regular.ttf

NotoSansBatak-Regular.zip

Where the font came from, and when

I believe from https://fonts.google.com/noto/specimen/Noto+Sans+Batak, that's where I usually download Noto fonts from. But I'm not 100% sure.

Date: 2021-04-01 (according to the file date on my folder)

Font Version

  • Win -- 3.1, August 2, 2020

Issue

Summarize the issue briefly -- one paragraph preferred

In combination like 'ing ong ung bing bong bung', etc, where the combination of letter+vowel+amisara is formed, currently the amisara is positioned above the vowel, while it should be positioned above the letter instead.

Character data

  • translit: bing bong bung ding dong dung ging gong gung
  • syllables: ᯅᯪᯰ ᯅᯬᯰ ᯅᯮᯰ ᯑᯪᯰ ᯑᯬᯰ ᯑᯮᯰ ᯎᯪᯰ ᯎᯬᯰ ᯎᯮᯰ
  • characters: ᯅ ᯪ ᯰ ᯅ ᯬ ᯰ ᯅ ᯮ ᯰ ᯑ ᯪ ᯰ ᯑ ᯬ ᯰ ᯑ ᯮ ᯰ ᯎ ᯪ ᯰ ᯎ ᯬ ᯰ ᯎ ᯮ ᯰ
    ** consonants: 1BC5 ᯅ BATAK LETTER BA, 1BD1 ᯑ BATAK LETTER DA, 1BCE ᯎ BATAK LETTER GA
    ** 1BEA ᯪ BATAK VOWEL SIGN I = ulua, 1BEC ᯬ BATAK VOWEL SIGN O = siala ulu, 1BEE ᯮ BATAK VOWEL SIGN U = boruta
    ** 1BF0 ᯰ BATAK CONSONANT SIGN NG = amisara

Screenshot

If possible, include a screenshot or an image illustrating the issue.
Annotations are also helpful.

image
Image 1: what it should look like: ang eng ing ong ung beng bing bong bung ceng cing cong cung deng ding dong dung geng ging gong gung dst sampai nyung

image
Image 2: what it currently look like: ᯅᯪᯰ ᯅᯬᯰ ᯅᯮᯰ ᯑᯪᯰ ᯑᯬᯰ ᯑᯮᯰ ᯎᯪᯰ ᯎᯬᯰ ᯎᯮᯰ

image
Image 3: Add MSS 15678, text 2 "ᯅᯮᯀᯖ᯲ ᯘᯖᯰᯖᯪᯰ ᯞᯮᯂᯮᯄ᯦᯲ ᯇᯝᯖᯂᯮᯖ᯲ ᯉᯪ ᯅᯮᯞᯉ᯲" translit "buat satangting luhuk pangatahut ni bulan" shows that in "satangting", the amisara is on top of consonant letter "ta", not the vowel letter "i".

U+1BFF BATAK SYMBOL BINDU PANGOLAT overlaps marks

Font

NotoSansBatak-Regular.otf

Where the font came from, and when

Site: https://github.com/googlefonts/noto-fonts/blob/ca434f80edeb6a1ecb65f4d40d69691d01f65abd/unhinted/otf/NotoSansBatak/NotoSansBatak-Regular.otf
Date: 2022-02-02

Font version

Version 2.001

Issue

U+1BFF BATAK SYMBOL BINDU PANGOLAT overlaps marks in some preceding clusters.

Character data

ᯆᯭ᯿ᯣᯮ᯿
U+1BC6 BATAK LETTER KARO BA
U+1BED BATAK VOWEL SIGN KARO O
U+1BFF BATAK SYMBOL BINDU PANGOLAT
U+1BE3 BATAK LETTER MBA
U+1BEE BATAK VOWEL SIGN U
U+1BFF BATAK SYMBOL BINDU PANGOLAT

Screenshot

ᯆᯭ᯿ᯣᯮ᯿

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.