GithubHelp home page GithubHelp logo

Urdu Layourt Requirements about alreq HOT 9 OPEN

vermaprashant1 avatar vermaprashant1 commented on September 1, 2024 1
Urdu Layourt Requirements

from alreq.

Comments (9)

r12a avatar r12a commented on September 1, 2024

@vermaprashant1 thanks for letting me know about this. It sounds interesting. Are you able to point me to an online document where i can review what you currently have? That will help me answer your question about what we can do.

from alreq.

vermaprashant1 avatar vermaprashant1 commented on September 1, 2024

@r12a it is available at https://tdil-dc.in/undertaking/article/112988Draft_Urdu-Recommendations.pdf for your reference

from alreq.

r12a avatar r12a commented on September 1, 2024

Thanks @vermaprashant1 but i'm getting time outs when trying to reach that file.

from alreq.

vermaprashant1 avatar vermaprashant1 commented on September 1, 2024

It seems working, Please find PDF.

Draft_Urdu-Recommendations.pdf

from alreq.

r12a avatar r12a commented on September 1, 2024

Thanks @vermaprashant1 I read through it a couple of times. As soon as i have a moment i'll add a new comment with questions and suggestions.

from alreq.

r12a avatar r12a commented on September 1, 2024

@vermaprashant1 here is what i'd propose wrt the information in your document. Does this sound like a good plan to you, or are you thinking along different lines?

  1. Create an Urdu gap-analysis document containing the following points:
    1. Generic fallbacks should cause browsers to choose nastaliq fonts by default
    2. First-letter styling should include all joined glyphs (this needs further discussion)
    3. The other essential RTL issues raised for other RTL script gap docs (which doesn't include form issues, since those fall out of the normal RTL rendering in the browser)
  2. Update the Ready-made Counter Styles document with 2 styles per the information in your document – there may be another gap issue here if doubled letters must not join
  3. Create a Layout Requirements doc for Urdu (already in the pipeline)
    1. I would normally do this by porting portions of https://r12a.github.io/scripts/arab/ur.html to a new document on our site, but would include any additional information in your document (with references)

Here are some additional things to consider:

  1. Places in your doc where more information is needed:

    1. Section 3 talks about 'letter-spacing' but gives no information about how that would work. Do you have such information? I have seen one font that elongates some letters, but usually stretching text is much more complicated in the arabic script and is not common in the nastaliq style. I don't usually find that Urdu nastaliq fonts support kashida elongation.
    2. Section 5 says that line-breaking rules must be applied, but doesn't say what they are other than that words should not be broken (which is already the default as far as i know). Is there more that is not covered by the default Unicode line-break rules?
    3. In section 8, could you clarify whether the highlighted text ends at the end of a word or at the first character that doesn't join to the left? I'm guessing word, given the example with the hamza at the end(?). But this seems at odds with the 'Standalone Form', where the highlighted letters seem to be part of the following word. Is the standalone approach different from the 'final unjoined' approach, or are they both simply terminating at the first non-left-joining letter?
    4. In section 9 is the 'Alpha, beta Listing' a fixed counter style? (ie. isn't used after U+063A)
  2. Information in your doc that we probably don't need to repeat in the lreq doc

    1. Sections 1 and 2 are for users
    2. Section 6 should be managed by the font, and i'm not aware of any special difficulties here.
    3. Section 7 is commonsense advice for users.
  3. Comments on your doc:

    1. In section 2 you may want to note that the user-installed fonts you mention can't be used in the Safari (WebKit) browser, because that browser allows use of system fonts only, unless the content author provides a web font.
    2. In section 7 your code should not be using CSS to set the text direction (see https://www.w3.org/International/questions/qa-bidi-css-markup). Urdu text should be surrounded by a p or div element which has a lang attribute and dir="rtl". The code shown is a very bad example.

What do you think?

from alreq.

vermaprashant1 avatar vermaprashant1 commented on September 1, 2024

@r12a Thanks for response. I will go through your comments in consultation with respective experts and will response soon.

from alreq.

r12a avatar r12a commented on September 1, 2024

@vermaprashant1 Here are some more questions about the counter styles information, arising from me drafting some text for the Ready-made Counter Styles doc:

  1. Your document says "In the ordered list, if the characters (defined in CSS ‘symbol’ property) repeat, they should not join." Then there follow some examples. Those examples are of the form 'a a', 'b b', 'c c', etc. This is quite unusual, and problematic for the standard CSS algorithms to produce. Usually the continuation would be 'a a', 'a b', 'a c', and so on. It's not clear to me whether the examples given are intended to indicate a need for the former approach, or whether they are just examples, and the desired approach is the latter (more usual) one.
  2. I though perhaps we would call one 'urdu-alphabetic' and the other (shorter) one 'urdu-abjad'. Does that sound reasonable? (Compare with the other styles at https://www.w3.org/TR/predefined-counter-styles/#arabic-styles)
  3. By default, the separator after each counter will be an ASCII full stop. Is this appropriate for Urdu counters, or should the separator be something else, such as U+06D4: ARABIC FULL STOP, which is what Urdu uses in normal text?
  4. In order to continue past the end of the initial set of characters for the alphabetic style, the letter in a counter must not join. In order to achieve this, i think we need to add a ZWNJ or a space as part of each counter symbol. Otherwise, i don't know how to achieve the non-joining behaviour without a custom algorithm. My second question is whether there needs to be some space between the parts of the counter, or not (ie. add a space, or add ZWNJ?). The definition would like like:
symbols: '\0627\0020' '\0628\0020' '\067E\0020'. etc // ie. ا ا    ب ب    ج ج

or

symbols: '\0627\200C' '\0628\200C' '\067E\200C'. etc //ie.   ا‌ا    ب‌ب    ج‌ج

from alreq.

r12a avatar r12a commented on September 1, 2024

@vermaprashant1 while we await the information about Urdu, do you know any Kashmiri experts who might be able to help with the question #270 ?

from alreq.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.