Docspell v0.4.0 Release Notes

Release Date: 2020-03-28 // about 4 years ago
  • Mar. 29, 2020

    • ๐Ÿ‘Œ Support for archive files. Archives, for example zip files, contain the files that should go into docspell. Docspell now extracts archives and adds the content to an item. The extraction process is recursive, so there may be zip files in zip files. File types supported:
      • zip every file inside is added to one item as attachment
      • eml (RCF822 E-Mail files) E-mails are considered archives, since they may contain multiple files (body and attachments).
    • ๐Ÿ“„ Periodic Tasks framework: Docspell can now run tasks periodically based on a schedule. This is not yet exposed to the user, but there are some system cleanup jobs to start with.
    • ๐Ÿ‘Œ Improvement of the text analysis. For my test files there was a increase in accuracy by about 10%.
    • A due date that is found during text analysis is now set on the item. If multiple due dates are found, the earliest one is used.
    • ๐Ÿ‘ Allow to switch between viewerjs PDF viewer or the browser's builtin viewer.
    • ๐Ÿ› Bug fixes related to handling text files.
    • โž• Add a configurable length limit for text analysis

    ๐Ÿ”ง Configuration Changes

    ๐Ÿ”ง The configuration of the joex component has been changed.

    • ๐Ÿ†• new section docspell.joex.periodic-scheduler for configuring the periodic scheduler
    • ๐Ÿ†• new section docspell.joex.house-keeping for configuring house-keeping tasks
    • ๐Ÿ†• new section docspell.joex.text-analysis for configuring the new size limit
    • The command for running wkhtmltopdf changed in that the encoding is now added at runtime.

    REST Api Changes

    The REST Api has some additions:

    • ๐Ÿ†• new route to retrieve the archive file
    • โž• add field in ItemDetail data that refers to the archive files of the attachments