In this work we introduce the Spanish Literary corpus MegaLite, a new corpus well adapted to
Natural Language Processing (NLP), Computational Creativity (CC), Text generation and
others studies. We address the creation of this corpus of literary documents to evaluate or
design algorithms in automatic text generation, classification, stylometry and rhetorical
analysis, sentiment detection, among other tasks. We have constituted this corpus manually in
order to avoir genre classification errors. Near of 5 200 works on the genres narrative, poetry
and plays constitute this corpus. Some statistics and applications of MegaLite corpus are
presented and discussed. The MegaLite corpus will be available to the community as a free
resource, under several adequate formats.