Skip to content

Scribe-Data 2.0.0

Compare
Choose a tag to compare
@andrewtavis andrewtavis released this 10 Oct 10:21
· 659 commits to main since this release

✨ Features

  • Scribe-Data now has the ability to download Wikipedia dumps of any language.
  • Functions have been added to parse and clean the above dumps.
  • Autosuggestions are generated from the cleaned texts by deriving most common words and those words that most commonly follow them.
  • A query for profane words has been added and integrated into the autosuggest flow to make sure that inappropriate words are not included.
    • The adjectives column has been removed from Scribe data tables until support is offered.

♻️ Code Refactoring

  • The error messages for incorrect args in update_data.py have been updated.