Kalusa data files

Kalusa was an ad-hoc collaborative constructed language, started by Gary Shannon and contributed to by maybe a dozen to two dozen people over the course of a few months in Summer 2006. The best description of the project I know if is by David J. Peterson, in giving the language his Smiley Award.

Here is the original source code for the Kalusa corpus management engine. It's a 40KB ZIP file containing 2629 lines of PHP and 780 lines of HTML and CSS. In email, Gary told me that he's released it into the public domain.

Below are data files saving sentences lost from the Kalusa corpus during an apparent automated-voting attack. Gary later modified the Kalusa software to disallow multiple votes on the same sentence from the same IP address.

The format of the files is tab-delimited, with four fields:

Sentence number Kalusa sentence English gloss CQ - correctness quotient, a weighted average of people's ranking of the sentence

The original Kalusa software would, online, display the number of people who had voted on a sentence as well as its CQ, but this data was not included in the automatically generated corpora text files for download.

kalusa_corpus-2006-05-29.txt (the oldest saved corpus I have; let me know if you are interested in any of those between 5/29 and 6/12)
kalusa_corpus-2006-06-12.txt
kalusa_corpus-2006-06-14-post-massacre.txt
kalusa_corpus-2006-06-16.txt
diff-kalusa.txt - a diff of the 6/12 corpus with the 6/16 corpus
kalusa_corpus-2006-06-26.txt (let me know if you want the corpora from 6/19, 6/20, 6/21 or 6/22)
sentences-which-had-CQ-eq-100.tab as of 6/12, and were dropped in the corpus attacks a little later; re-added since
sentences-which-had-CQ-gtr-100.tab as of 6/12, and were dropped in the corpus attacks a little later; re-added since
The Saga of Malia and Kuana from the 5/29 corpus

Other Kalusa memorial sites:

Sean B. Palmer's Kalusa corpora
David J. Peterson's Kalusa review
A conlang free-for-all - the first announcement about Kalusa, and initial discussion
Kalusa: War of the Words - May 2006 thread on the CONLANG list
News from the Island of Kalu - 16 June 2006
Kalusa conlang in review - is it working? - August 2006 thread on the CONLANG list
Some comments on Kalusa in the course of a thread on "Defining 'Language'", 20 July 2007

The Perl scripts I used for automating interaction with the Kalusa engine (e.g., downloading the corpus to a text file periodically, entering a batch of new sentences I'd composed offline, etc.)

Last updated September 2010
My conlang page