Building a Corpus System

Tuesday, 25 March 2014

comments again

Comments crawled and search functionality added.

On the main page you can search in articles, comments, or both, with both case sensitive and case insensitive searches.
Posted by Unknown at 14:17
Email ThisBlogThis!Share to XShare to FacebookShare to Pinterest

No comments:

Post a Comment

Newer Post Older Post Home
Subscribe to: Post Comments (Atom)

Blog Archive

  • ▼  2014 (25)
    • ►  September (3)
    • ►  July (2)
    • ►  May (3)
    • ►  April (3)
    • ▼  March (14)
      • comments again
      • Comments
      • New feeds and tagging
      • Generic user-assisted feed parser
      • more deduplication
      • Deduplication and tokenizing
      • language identification and deduplication
      • duplicate removal and multithreading
      • Comments
      • Disqus Comments
      • rss, copyright, beautiful soup
      • rss, regex, dates, modules and metadata, encoding
      • better tokenizing and beginning of database design
      • iol, regex
Simple theme. Powered by Blogger.