The SoNaR Corpus has a newspaper component (WR-P-P-G) containing nearly 15 million sentences. See also Reference_corpora.