<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=utf-8">
</head>
<body style="word-wrap: break-word; -webkit-nbsp-mode: space; line-break: after-white-space;" class="">
<br class="">
<div>
<div class="">
<div style="word-wrap: break-word; -webkit-nbsp-mode: space; line-break: after-white-space;" class="">
Dear friends and colleagues,<br class="">
<div class=""><br class="">
<div class="">
<div class="">I'm pleased to announce the release of two parsed corpora of Old and Middle</div>
<div class="">
<div class="">French, containing a total of roughly 1.6 million words:<br class="">
<br class="">
- Modéliser le changement: Voies de français (MCVF) (ca. 850,000 words)<br class="">
- Penn-BFM Parsed Corpus of Historical French (PPCHF) (ca. 750,000 words)<br class="">
<br class="">
The MCVF is distributed in two versions:<br class="">
<br class="">
- version 1.0, the original 2009 release (with XML, tagged, and parsed files)<br class="">
- version 2.0, containing revised and corrected versions of the parsed files<br class="">
<br class="">
The PPCHF contains additional texts, especially from the Early Old French </div>
<div class="">period before 1200. More than half of the material is based on online editions</div>
<div class="">published by the Base de Français Médiéval (BFM).</div>
<div class=""><br class="">
The corpora are distributed under a Creative Commons License </div>
<div class="">Attribution-NonCommercial-ShareAlike 4.0 International CC BY-NC-SA 4.0 </div>
<div class="">(<a href="https://creativecommons.org/licenses/by-nc-sa/4.0/" class="">https://creativecommons.org/licenses/by-nc-sa/4.0/</a>), which allows you to</div>
<div class="">add information of your own to the data (say, lemmatization). The corpora </div>
<div class="">can be downloaded at</div>
<div class=""><br class="">
<a href="https://github.com/beatrice57/mcvf-plus-ppchf/" class="">https://github.com/beatrice57/mcvf-plus-ppchf/</a></div>
<div class=""><br class="">
The following information is included in the github repository, but is also </div>
<div class="">available online for stand-alone reference, so that you can get a sense of </div>
<div class="">the contents before downloading:</div>
<div class=""><br class="">
Source information about the texts<br class="">
<br class="">
<a href="https://www.ling.upenn.edu/~beatrice/corpus-ling/french-corpora-sources/index.html" class="">https://www.ling.upenn.edu/~beatrice/corpus-ling/french-corpora-sources/index.html</a></div>
<div class=""><br class="">
Annotation guidelines<br class="">
<br class="">
<a href="https://www.ling.upenn.edu/~beatrice/corpus-ling/annotation-french" class="">https://www.ling.upenn.edu/~beatrice/corpus-ling/annotation-french</a></div>
<div class=""><br class="">
</div>
<div class=""><br class="">
</div>
<div class="">Best regards,</div>
<div class=""><br class="">
</div>
<div class="">Beatrice Santorini</div>
<div class=""><br class="">
</div>
<div class=""><br class="">
<br class="">
<br class="">
<br class="">
</div>
</div>
</div>
</div>
<br class="">
</div>
</div>
</div>
<br class="">
</body>
</html>