Here is our LREC poster:
When using parts of this work, please cite:
@inproceedings{Buck-commoncrawl,
author = {Christian Buck and Kenneth Heafield and Bas van Ooyen},
title = {N-gram Counts and Language Models from the Common Crawl},
year = {2014},
month = {May},
booktitle = {Proceedings of the Language Resources and Evaluation Conference},
address = {Reykjavk, Iceland{i}k, Iceland}
}