-
Notifications
You must be signed in to change notification settings - Fork 2
Expand file tree
/
Copy pathindex.html
More file actions
69 lines (56 loc) · 3 KB
/
index.html
File metadata and controls
69 lines (56 loc) · 3 KB
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
---
layout: bootstrap
menu_item: home
---
<div class="well">
<p class="lead">Helping collect the world's language diversity heritage.</p>
</div>
<p>
The Cross-Linguistic Linked Data project is developing and curating interoperable data
publication structures using Linked Data principles as integration mechanism for distributed
resources.
</p>
<p>This philosophy allows for</p>
<ul>
<li>small-scale efforts to publish individual databases like <a href="https://wals.info">WALS</a> (World Atlas of Language Structures) or <a href="http://wold.clld.org">WOLD</a> (World Loanword Database), thereby preserving the brands established by these projects, </li>
<li>while at the same time facilitating a unified user experience across publications.</li>
</ul>
<p>
Within the project, this approach is applied to publishing lexical and grammatical
databases already compiled at the <a href="http://www.eva.mpg.de">MPI-EVA</a> and elsewhere.
This has led to a <a href="https://github.com/clld/clld">software framework</a> which can be
used to develop database journals, i.e. edited collections of databases submitted by linguists from around the
world.
</p>
<p>
A list of databases implemented as <a href="https://github.com/clld/clld">clld</a> applications
and published on the CLLD platform is available following the <a href="datasets.html">Datasets</a> link.
</p>
<p>
<a href="https://dictionaria.clld.org">Dictionaria</a> — a journal of dictionaries of less widely studied languages,
edited by Martin Haspelmath & Barabara Stiebels — which runs on the clld framework has already published 10 dictionaries.
</p>
<p>
For the purposes of linking linguistic data uniquely to languages, language codes are needed
for each language and each variety. For this reason, the CLLD project also comprises:</p>
<ul>
<li>
<a href="http://glottolog.org">Glottolog</a> (catalog of all languages, families and dialects, with comprehensive reference information), edited by Harald Hammarström, Martin Haspelmath & Robert Forkel</li>
</ul>
<h3>CLLD and CLDF</h3>
<p>
Arguably the most important outcome of the CLLD project was the specification of the
<a href="https://cldf.clld.org">CLDF standard</a>. CLDF provides a standard and guidelines to
store linguistic datasets as interrelated plain text files, facilitating
</p>
<ul>
<li>longterm archiving and FAIR access to such datasets via repositories like Zenodo,</li>
<li>a standardized submission format for journals such as Dictionaria,</li>
<li>simplified creation of clld applications from CLDF module-specific blueprints.</li>
</ul>
<p>
Using CLDF datasets as "input" for clld applications also solves one of the bigger problems of
publishing data in a web application: How to handle multiple versions of the data?
With CLDF, datasets can be versioned and multiple version can be published in a repository
while the web application is relegated to a browsable interface of the latest published version.
</p>