pe-reconstruct

This repository contains a demo of a project to automate the restoration/expansion of the long elision (the anupubbasikkhā peyyāla) common to 11 of the 13 suttas in the Sīlakkhandhavagga of the Digha Nikāya of the Suttapiṭaka of the Pāli Tipiṭaka (DN 3 through DN 13). Since automating this requires some text substitutions that are specific to particular texts, this demo is constructed for Ven. @Sujato's English translations.

Resources

The resources needed to thus reconstruct the 11 suttas DN 3 through DN 13 are (in addition to the bilara-data JSON files themselves for each cognate of the sutta) a single script and three simple JSON files for each sutta, provided as part of this demo.

./restore-sutta.raku
./map/dn*_pe-map.json
./map/en/dn*_pe-subst-en-sujato.json
./map/pli/dn*_pe-subst-pli-ms.json

restore-sutta.raku

The script is written in Raku, but I plan to reimplement in Python if there is interest in this. The only dependency is on a standard JSON-parsing module: more information on this below. See below also for command-line arguments that the script will accept.

The pe-map.json files

For each sutta, the pe-map.json segment-mapping file is universal to all six cognates and should work without modification for all translations that are aligned with SuttaCentral's segment numbering.

The pe-map.json file contains "virtual" segment IDs for the elided segments of the target sutta, associating each one with the corresponding segment of DN 2 (the source from which the peyyala text is restored). Occasionally an elided segment is restored with the text of a repeated segment from elsewhere within the same sutta, not from DN2.

The only time this file would need to be updated is when the segmentation changes, specifically the segment numbering.

The pe-subst.json files

There are some textual substitutions in the DN 2 text needed for the English translation, and a separate set needed for full support of interlinear & side-by-side display of root text. These are provided for by two pe-subst.json files for each sutta.

The vast majority of these changes are subsection numbering and interlocutor vocatives, so while the pe-subst-en-sujato.json files may need updates more often than pe-map, they should be quite stable once completed.

Support for interlinear or side-by-side display of the Pāli root text is partially implemented as of May 2025. Please look over the reconstructions of the root texts provided for this purpose and let me know what I've missed.

How to run the demo

If you can't run the script but want to inspect the results, the directory

./demo

contains a pre-generated set of files with the long peyyala expanded.

The way I test this on my system is

download a clone of the bilara-data repository, and then
in the same directory, clone the pe-reconstruct repository
bilara-data and pe-reconstruct must both be in the same directory
cd into the pe-reconstruct directory and run the script
the results will be in pe-reconstruct/reconst

Or to spell it out:

git clone https://github.com/suttacentral/bilara-data.git
git clone https://github.com/arkiuat/pe-reconstruct.git
cd pe-reconstruct
./restore-sutta.raku
ls -l reconst

The script uses relative pathnames via ../bilara-data to read the original elided versions of the cognate files, which is why pe-reconstruct and bilara-data must be sister directories, that is, side-by-side in the same parent directory.

Command-line parameters

If no command-line arguments are supplied, the script reconstructs expanded versions of all six cognates of all eleven suttas.

If you want to generate the reconstructed cognate files for just one sutta, specify its number on the command line, e. g.:

./restore-sutta 12

will only reconstruct (with peyyala expanded) the files for Lohiccasutta.

If you have made a minor change somewhere or for any other reason want to regenerate only a single cognate of a single sutta, you can specify both the sutta-number and the cognate tag on the command line:

./restore-sutta 12 translation

will generate only the expanded translation.json cognate file for Lohiccasutta. Adding this feature was perhaps overkill, because the script already won't overwrite a cognate file that is identical to the one it just generated, so if you run it without args in this situation, it will generate all the cognate files and check them, but it will only write the ones that have actual changes from their sources.

How to install Raku, JSON::Tiny, and zef

Of course, you won't be able to run the script in step 3 if you don't have Rakudo installed, and also the small JSON-parsing module used. (Raku is the programming language, Rakudo is the current main implementation.)

Directions for this may be found at the following URLs:

You may want to install zef, the standard Raku module installer, before installing JSON::Tiny.

https://raku.land/zef:ugexe/zef

Additional information

https://docs.raku.org/ is the official Raku documentation page.
https://rakudo.org/ has other information about Rakudo.
https://raku.land/ is a directory of Raku distributions such as modules, etc.
https://raku.org/ is a one-stop-shop that should link to most of the above.

Additional background on this project

Some sheets developed to assist with the process of developing the *pe-map.json files.

https://docs.google.com/spreadsheets/d/1WxgF3UnEtbZBj_0ikXYsbquO0h8XfxRBrVWuaWUQYh8/

I have documented the procedure I follow to put together pe-map.json and pe-subst.json files, and may publish that here at some point, perhaps after I try applying it to a few Majjhima Nikāya suttas, which will probably shake some bugs out of the process.

To Do

Updates to the .JSON files

So far I'm the only one making decisions about how the pe-map.json files are constructed, and for some small bits I can see that there is a case to be made for more than one way to go about things. I hope that after publishing this demo, the community can rapidly attain a consensus on these questions. I plan to update the map files to accord with this consensus as it develops.
The pe-subst-en-sujato.json files will need to track changes as Ven. @Sujato revises the published translations. I expect that a very small percentage of future revisions will require any updates to the pe-subst-en-sujato.json files.
If there is interest, I may add pe-subst-*.json files for other segment-aligned translations. If I get collaborative support, we may do this for segment-aligned translations into other languages.
Because proper support for interlinear and side-by-side display is the newest feature added before initial publication, I have not yet finished constructing all of the pe-subst-pli-ms.json files correctly. Originally I hadn't intended to touch the Pāli root text, but once things got to a certain point, it seemed a shame not to support interlinear and side-by-side display. These will be updated.

Updates to the script

Right now the script is highly specific, with both DN (as domain) and DN2 (as text-source) hard-coded in. In the next phase I hope to extend this so that the same script will also handle elision-restorations in Majjhima Nikāya suttas as well, with different suttas used as source for the material to be restored, so in that iteration of the project, neither DN nor DN2 will be hard-coded in anymore.

Other updates

Of course, no peyyala can be restored by this system without hand-building a pe-map.json file for each individual sutta, and for the time being, I'm the only one doing that kind of thing. To address this, once I've improved the production process as much as I can, I plan to document it and post it here.

Current TODOs

rewrite this README to address TLDR (say same in fewer words)
finish producing subst-pli-ms.json files for DN6 & DN7
double-check accuracy & completeness of the subst-pli-ms.jsons (HELP NEEDED)
finish the write-up of issues encountered that Ven. @snowbird expressed interest in
finish documenting production process for pe-{map,subst}.json files; do 5th pass
consider rewriting restore-sutta script in Python. What JSON module do SC's python scripts use?
begin identifying longer peyyālas in Majjhima Nikāya's first 50

Name		Name	Last commit message	Last commit date
Latest commit History 122 Commits
demo		demo
map		map
reconst		reconst
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
restore-sutta.raku		restore-sutta.raku

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

pe-reconstruct

Resources

restore-sutta.raku

The pe-map.json files

The pe-subst.json files

How to run the demo

Command-line parameters

How to install Raku, JSON::Tiny, and zef

Additional information

Additional background on this project

To Do

Updates to the .JSON files

Updates to the script

Other updates

Current TODOs

About

Uh oh!

Releases

Packages

Languages

License

arkiuat/pe-reconstruct

Folders and files

Latest commit

History

Repository files navigation

pe-reconstruct

Resources

restore-sutta.raku

The pe-map.json files

The pe-subst.json files

How to run the demo

Command-line parameters

How to install Raku, JSON::Tiny, and zef

Additional information

Additional background on this project

To Do

Updates to the .JSON files

Updates to the script

Other updates

Current TODOs

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages