Find search terms in the Web Corpus
and similar data sources

Search with wildcard for terms occurring in the specified dataset applying selected filters
 

Symbols
    *  wildcard 0-... chars
    ?  wildcard 1 char
    |  OR
    ~  NOT
Leave query blank to return 50 random items 

Data Source Filters
Dataset
 1-grams Web Corpus 2006
"dirty" 1-grams, 1,420,092 types
1-grams Web Corpus 2007, release 1
3,123,996 types, 518,129,710 tokens
  CUVPlus
forms of lemmas occurring in the Oxford Advanced Learner's Dictionary of Current English in David Hardcastle's extension of CUVPlus; UK spelling only.
  SPECIALIST
(bio)medical terms as well as many general wordforms in the National Institutes' of Health National Library of Medicine's SPECIALIST lexicon.
  both preceding wordlists
  BNC
over half a million wordforms in the British National Corpus
  CUVPlus, SPECIALIST nand BNC
merged UPPER-CASE / lower-case 1-grams, 4,994,732 types
  Urban Dictionary
259,717 entries on 13 June 2007, many with multiple definitions

Max. types to show: 1000  | 5000  | 10,000  | 50,000

Feedback and suggestions encouraged.
http://webascorpus.org/searchseeds.html ver. 2 August 2007