erc-dharma / tfc-campa-epigraphy Goto Github PK

DHARMA project Task Force C Campa epigraphic corpus. Some files have be reused from the Corpus of the Inscriptions of Campā.

Home Page: https://dharma.hypotheses.org/

License: Creative Commons Attribution 4.0 International

Rich Text Format 100.00%

epigraphy epidoc dharma tfc-campa-epigraphy inscriptions campa xml-tei

tfc-campa-epigraphy's Introduction

tfc-campa-epigraphy

DHARMA project Task Force C, Campa epigraphic corpus. Some files have be reused from the Corpus of the Inscriptions of Campā. For more information, see the website description.

Repository Static page

You can consult the current state of the work on this repository at https://erc-dharma.github.io/tfc-campa-epigraphy/.

Help and Guides

A Guide for Git is provided in the project documentation repository at DHARMA_Git_guide_v01.md
DHARMA encoding guide is available under encoding-diplomatic, but we recommend you to use the google doc version to access the latest edits.
Encoding templates are also provided under templates
Feel free to use issues to ask questions to other members of the project. To Get started with issue, see the guide.

Validation

All DHARMA XML are validated against Epidoc schema in its latest version as well as against DHARMA ones. You can declare those schemas in the processing instructions of every XML files. To use it with Oxygen, you might need to change the default configuration. In the toolbar, go to the Options > Preferences, a window will open, select XML > XML Parser > RelaxNG. The check box for "Add default attributes values" must be unchecked. Click on Apply. You might need to restart Oxygen for it to be taken into account.

DHARMA validation schemas
- a RelaxNG with embedded Schematron code is available here.
- a Schematron with embedded Schematron Quick Fixes to complete the first schema is available here

Note that both schemas are needed to a complete validation of DHARMA encoding rules

If you use an online validation processing, the processing instructions should be as followed

<?xml-model href="https://raw.githubusercontent.com/erc-dharma/project-documentation/master/schema/latest/DHARMA_Schema.rng" type="application/xml" schematypens="http://relaxng.org/ns/structure/1.0"?>
<?xml-model href="https://raw.githubusercontent.com/erc-dharma/project-documentation/master/schema/latest/DHARMA_Schema.rng" type="application/xml" schematypens="http://purl.oclc.org/dsdl/schematron"?>
<?xml-model href="https://raw.githubusercontent.com/erc-dharma/project-documentation/master/schema/latest/DHARMA_SQF.sch" type="application/xml" schematypens="http://purl.oclc.org/dsdl/schematron"?>
<?xml-model href="http://www.stoa.org/epidoc/schema/latest/tei-epidoc.rng" schematypens="http://relaxng.org/ns/structure/1.0"?>
<?xml-model href="http://www.stoa.org/epidoc/schema/latest/tei-epidoc.rng" schematypens="http://purl.oclc.org/dsdl/schematron"?>

However, if you need to do it locally, you can access all DHARMA RelexNG and Schematron in their latest version in the projection-documentation repository under schema/latest. In this case, you will need to update the @href and provide the path between you current file and the schema itself (either as a standalone file or either as a part of the project-documentation repository if you have cloned it)

Workflow

Before working with this repository, make sure you always the latest version through a git pull
Create a XML file or edit an existing one.
Once, you are done use git to add, commit and push it on the repository.
Once you have done your git push, the XML files are validated against the Epidoc and Dharma RelaxNG with a CI service called Travis. This process is made possible with the existing .travis.yml file. Do not delete or modify this file on your own. You can access the resulting log here
On the git push, we are also using github Actions service to transforme automatically the XML into a HTML version. We are using Ant in a java environment to do this batch transformation. The steps are declare in the build.xml file, while the automated scenario is provided under .github/workflows/editorial.yml Do not delete or modify those files on your own.
- Step 1: it will editorially edit all the XMLs in the repository.
- Step 2: from those edited XMLs, it will create a HTML output and retrieve Zotero data as well.
- Step 3: it will push those files in the repository under editedxml and htmloutput folders.
- Step 4: to access those newly created files, make a git pull
- Step 5: all the HTML are available as a static website thanks to GitHub pages service. Note that the README is the by default page if no XMLs are stored in the repository: https://erc-dharma.github.io/tfc-campa-epigraphy/
Please note that this process can break easily and requires high quality data. The content log of the pipeline are available under Actions tab of the repository, if you need to find the reason behind an error.

License & Attribution

All the edited XML files of this repository are available under the Creative Commons CC-BY 4.0 License, meaning you are free to use it for any purpose, commercial or non-commercial! However, we ask you to mention the project name, funder as well as the members involved in the XML edition of the inscriptions. Feel free to contact the project for more infos at [email protected].

Contributing to this repository

Thanks for your interest in contributing! Currently only members of the project are allowed to contribute to this repository. Contact the researchers in charge of this corpus if you are interested in working with them.

tfc-campa-epigraphy's People

Contributors

Stargazers

Watchers

Forkers

paregorios

tfc-campa-epigraphy's Issues

restrict access to rtf files?

Je viens de déplacer un lot de fichiers rtf depuis le repo Campa-Epigraphy.

Est-il possible de limiter la visibilité/accessibilité de ce dossier aux seuls membres du projet?

réviser C. 142

Chère @salomepichon — Je viens de faire quelques modifications dans le fichier, séparant une phrase en sanskrit que tu avais incluse dans un paragraphe labélisé xml:lang="ocm-Latn" et répondant à de nombreuses remarques datant je pense de 2021. J'ignore si tu as depuis relu l'inscription avec les orthophotos générées par Adeline. Si ce n'est pas encore fait, ce sera certainement utile. À l'époque, tu avais encore une approche très "je vois ça sur la pierre" insuffisamment modérée par une conscience de ce qui est probable. Et tu n'osais ni proposer des apports au fichier Symbol Taxonomy ni poser des questions sur l'encodage à Daniel.

Il faut donc bien mettre à jour ce fichier.

données sur ton travail dans teiHeader

@salomepichon —

je me mets à résoudre les erreurs de validation dans DHARMA_INSCIC00001.xml et trouve ces lignes dans le teiHeader:

	<respStmt>
					<resp>EpiDoc Encoding</resp>
					<persName ref="part:sapi">
						<forename>Salomé</forename>
						<surname>Pichon</surname>
					</persName>
				</respStmt>
				<respStmt>
					<resp>intellectual authorship of edition</resp>
					<persName ref="part:argr">
						<forename>Arlo</forename>
						<surname>Griffths</surname>
					</persName>
					<persName ref="part:sapi">
						<forename>Salomé</forename>
						<surname>Pichon</surname>
					</persName>
				</respStmt>
				<respStmt>
					<resp>Conversion of encoding for DHARMA</resp>
					<persName ref="part:sapi">
						<forename>Salomé</forename>
						<surname>Pichon</surname>
					</persName>
				</respStmt>

Personnellement, j'ai l'impression que tu n'as jamais touché à ce fichier (il n'y a pas de ligne <change> te concernant), que tu n'as pas d'autorité intellectuelle sur l'édition (car j'ai édité moi-même C. 1 avant que tu deviennes mon étudiante), et du coup qu'Axelle a dû insérer ces infos dans tous les fichiers qu'elle a convertis depuis l'ex-répositoire du Corpus des inscriptions du Campā.

Le confirmes-tu?

Je devrai alors regarder au cas par cas si ces informations te concernant sont correctes ou non.

@n on <ref> with @target="DHARMA_INS....xml"

Je trouve des élément du genre <ref n="tfc-campa-epigraphy" target="DHARMA_INSCIC00096.xml">C. 96</ref> dans les fichiers convertis. Que vient faire ici le @n?

replace language/script codes

Il semble que les fichiers convertis du corpus Campa permettraient un remplacement global des codes @xml:lang="sa-Latn-x-CI" pour donner tant @xml:lang="san-Latn" que @rendition="class:83233 maturity:83213" — peux-tu le mettre en oeuvre, @ajaniak?

pourrais-tu en même temps insérer dans tous les éléments qui contiennent actuellement @xml:lang="san-Latn" ou "@xml:lang="ocm-Latn" un attribut supplémentaire que nous allons remplir, @salomepichon et moi: @rendition="class:83233 maturity:00000"?

merci!

Arlo

DHARMA_INSCIC00072.xml ready for conversion to DHARMA norms

@ajaniak: Dans ton email à Salomé de l'autre jour, tu as signalé que "DHARMA_INSCIC00072.xml n'a pas été modifié en raison de son contenu majoritairement en latex". J'ai maintenant converti toutes les balises LaTeX en TEI. Peux-tu voir si du coup le fichier peut être converti? Merci.

display of <milestone>

@ajaniak : it seems that Salomé has correctly encoded the milestones in DHARMA_INSCIC00142 but the display is not consistent. Do you understand why? In the case of faces C and D, I wonder if the asymmetrical display with a small [ and a big ] is intentional.

Champa geodata

@salomepichon : faire un git pull. Tu peux voir ici précisément ce que j'ai changé dans le fichier geo suite au mail de hier de Ky sur le lieu de découverte de C. 157. J'ai

corrigé l'orthographe de Cẩm Vân en Cẩm Văn
déplacé l'entrée déjà existante pour Cẩm Văn en le casant sous le district de Điện Bàn
enveloppé l'entrée dans un niveau intermédiaire (village de Điện Hồng)
cité les infos de Ky dans une <note>

Tu remarqueras que je n'ai pas créé un hameau correspondant aux noms cités par Ky car nous ne savons pas où dans l'ancien village de Cẩm Văn, avant sa scission en trois hameaux, l'inscription a été trouvée.

@michaelnmmeyer : pour info. Un ancêtre de ce fichie geo-cic-dharma.xml avait été créé pour moi par Tom Elliott en 2009. Notre idée, avec Adeline, est de continuer à l'alimenter et améliorer encore un temps, puis d'en extraire les données pour créer des lieux avec identifiants dans PLEIADES. Ces identifiants seront ensuite liés aux inscriptions dans nos métadonnées. Tu nous le diras si tu as quelque choses à dire sur notre démarche.

@alevivier : pour info.

language codes

Dear @michaelnmmeyer —

Could you please replace all cases of xml:lang="x-oldcam-Latn" in tfc-campa-epigraphy by xml:lang="ocm-Latn"?

Thanks!

Arlo

spelling normalizations in translation and lexicon

il faut bien tenir compte des remarques dans mon articles (version 20240214):

p. 34
It is important to note that the marker transliterated as m̃ has nothing to do with /m/, but rather appears in the following four functions: [...] 4. to express a schwa vowel in the syllable nucleus (gam̃p = gəp).

p. 35
In periods and areas where the m̃ is not yet in use, or is used only sporadically, the functional scope of the marker ṁ (the Indic anusvāra) covers that of the m̃ described above.

La remarque p. 35 s'applique à tout ou partie du corpus Indrapura.

Du coup, dans ta traduction et dans tes apparts lexicographiques fondés sur C. 106, je propose les modifications suivantes :

Ricaṁm >> Ricəm
Avaṁk >> Avək

Enfin, coquille de ta part: Ñjun >> Ñjuṅ

Tu revérifieras tes autres traductions ainsi que le vocabulaire que tu en as extrait dans cam-lexicon?

C. 148

il semble qu'aucune valeur @n n'ait été déclaré sur sur le <bibl> pour Majumdar 1927
peux-tu copier/coller l'édition https://dharmalekha.info/texts/DHARMA_INSCIC00148 dans le fichier txt sur electronic-texts? il faut juste t'assurer qu'il n'y a pas actuellement de remarques utiles dans ce fichier txt qu'on perdrait
la traduction reste à terminer

<p rend="stanza"> in translation

@michaelnmmeyer : do you understand why the schema rejects my use of <p rend="stanza"> in the file DHARMA_INSCIC00137.xml? I am consulting @danbalogh about the other validation problem in project-documentation issue #301.

replace ṛ by r̥

@ajaniak : could you implement this replacement in all
tfc-campa-epigraphy xml files? thanks!

code erronée dans un html?

Cher Axelle,

L'autre jour en faisant un git pull j'ai reçu un html modifié, sans pour autant que le xml source semble avoir été modifié récemment:

Quand je regarde le dernier commit généré par le workflow, je vois qu'une modification a affecté DHARMA_INSCIC00072.html mais je ne sais pas comment le modification a été générée. Aussi, je vois qu'il y a des éléments bibliographiques qui viennent interrompre le paragraphe et que l'interruption semble être devenue plus grave après ce commit.

Pourrais-tu voir ce qui se passe?

Merci!

Arlo

values of @met

Je vous dans les fichiers convertis que les valeurs de @met commencent avec majuscules. selon EGD, App. B, les valeurs doivent toujours commencer avec minuscule.

@ajaniak : peux-tu appliquer un remplacer global des majuscule par des minuscules? merci!

erc-dharma / tfc-campa-epigraphy Goto Github PK

tfc-campa-epigraphy's Introduction

tfc-campa-epigraphy

Repository Static page

Help and Guides

Validation

Workflow

License & Attribution

Contributing to this repository

tfc-campa-epigraphy's People

Contributors

Stargazers

Watchers

Forkers

tfc-campa-epigraphy's Issues

Recommend Projects

Recommend Topics

Recommend Org

Jobs