This concordance of the rongorongo corpus was produced using
Barthel's numerical transcription of the corpus, as cross-checked,
corrected, and made
available by the C.E.I.P.P. (Centre d'Études sur l'Île de
Pâques et la Polynésie).
It replaces the concordance uploaded here on 21 October 2001,
which was produced from C.E.I.P.P. data files dating from late
1997, which still contained many uncorrected mistakes out of Barthel's
corpus. All the major texts have now been checked against Barthel's
original rubbings and corrected by the C.E.I.P.P. Rongorongo Commission
who is currently processing line 6 of the Santiago Staff. When the
Santiago Staff has been processed only minor texts, such as that on the
Snuff Box, will remain to be transliterated.
- Download these two ZIP files:
concord1.zip (250,143 bytes) and
concord2.zip (229,017 bytes). The file concord1.zip
contains the concordance for signs 001 to 079, split into 6 files;
concord2.zip contains signs 080 to 791, split into 8 files (there are no
signs beyond 791).
- Unzip them. They unzip into 14 files, about 1.0M in total size,
each with a ".ccd" extension. These are straight ASCII text,
each line terminated with a CRLF. They can be browsed with any
text editor capable of handling files up to at least 125k, which is
the size of the largest file, 001.ccd, which contains signs
001 to 004.
Each sign is announced by the word "Sign" followed by its Barthel
number, and its number of occurrences. Then, for each occurrence,
the sign is displayed, one per line, in its context. The signs
have been stripped of leading zeroes so that more could be displayed
on each line. Thus below 001 appears as just 1, 006 as 6, 093 as 93.
- At the beginning of each line is a single uppercase letter
identifying the object, followed by the side identifier: r
for recto, v for verso, or a for side a
and b for side b (when the order of reading could not
be determined). Then comes the line number and the position of
the sign on the line. Thus Ra08.16: tablet R, side
a, line 8, 16th sign.
- Worn-out signs are represented by a 0, and sequences of
worn-out signs by notations of the form (5-8), i.e. from 5 to
8 signs unrecognizable for wear. For the purpose of counting sign
positions, such a sequence counts as one sign only.
- Exclamation marks follow signs remaining to be definitely
identified, question marks those of doubtful identification.
- Asterisks represent ends of lines, thus in Aa07.129 below, you
can see that sign 139V occurred right at the end of line Aa07 (line 7, side
a of tablet A, also known as Tahua or The Oar), with sign 599 starting the
- In each line, a pipe (|) precedes the sign which is
the object of the concordance, so that it can be located
easily. For the other letters and punctuation marks, see
the explanation Barthel's notation system
and the C.E.I.P.P.'s
Extended Barthel System
Sign 004, 410 occurrences:
- Sorting Order. For sorting purposes punctuation (hyphens,
colons, etc.) was ignored and so were sign suffixes (the letters
that often occur following a sign number), but leading zeroes were
used. Sequences of illegible signs, e.g. (5-8), were sorted as if
consisting of a single 000 code. Thus this sequence of signs:
was sorted as if it had been:
001 000 000 005 006 007 093 050 074 008 400 281 001
- Implementation. The concordance was coded in
using the same algorithm as Jacques Guy's Monkey
program, the description of which is available in the
archives of the Voynich Interest Group. Those archives
now occupying several megabytes, finding this algorithm
is quite difficult, and so it is reproduced
here. If you are interested
in undeciphered writings and in the methodology of
decipherment, these two sites are recommended:
by Dennis Stallings and
by René Zandbergen.