BRWAC

Composed by 2.7 billion tokens, and has been annotated with tagging and parsing information.

Source: The brWaC Corpus: A New Open Resource for Brazilian Portuguese