Changelog History
Page 2
-
v0.4.12 Changes
August 10, 2020๐ This is a minor bugfix release with some Dockerfile improvements to qualify for the official docker image library.
-
v0.2.4 Changes
February 27, 2019- ๐ better archive corruption guards (check structure invariants on every parse & save)
- โ remove title prefetching in favor of new FETCH_TITLE archive method
- ๐ slightly improved CLI output for parsing and remote url downloading
- โก๏ธ re-save index after archiving completes to update titles and urls
- โ remove redundant derivable data from link json schema
- ๐ markdown link parsing support
- ๐ faster link parsing and better symbol handling using a new compiled URL_REGEX
-
v0.2.3 Changes
February 19, 2019- ๐ fixed issues with parsing titles including trailing tags
- ๐ fixed issues with titles defaulting to URLs instead of attempting to fetch
- ๐ fixed issue where bookmark timestamps from RSS would be ignored and current ts used instead
- ๐ fixed issue where ONLY_NEW would overwrite existing links in archive with only new ones
- ๐ fixed lots of issues with URL parsing by using
urllib.parse
instead of hand-written lambdas - ๐ฑ ignore robots.txt when using wget (ssshhh don't tell anyone ๐)
- ๐ fix RSS parser bailing out when there's whitespace around XML tags
- ๐ fix issue with browser history export trying to run ls on wrong directory
-
v0.2.2 Changes
February 07, 2019๐ This is a bugfix release, many parts of the parsing process have been improved or fixed.
- ๐ Shaarli RSS export support
- ๐ Fix issues with plain text link parsing including quotes, whitespace, and closing tags in URLs
- โ add USER_AGENT to archive.org submissions so they can track archivebox usage
- โ remove all icons similar to archive.org branding from archive UI
- hide some of the noisier youtubedl and wget errors
- set permissions on youtubedl media folder
- ๐ fix chrome data dir incorrect path and quoting
- ๐ better chrome binary finding
- ๐ show which parser is used when importing links, show progress when fetching titles
-
v0.2.1 Changes
January 11, 2019๐ This is a feature-packed release, so it's likely to be a little buggier than usual!
๐ New features:
- ๐ ability to load any plain text list of links (also the new fallback for all parses)
- WARC file saving via wget:
FETCH_WARC=True
- Git repository downloading with git clone:
FETCH_GIT=True
GIT_DOMAINS=github.com,gitlab.com,bitbucket.org
- Media downloading with youtube-dl:
FETCH_MEDIA=True
MEDIA_TIMEOUT=36000
๐ Bugfixes:
- autodetect the correct chromium binary in almost all cases
- ๐ป create browser history export folder automatically
- higher allowed timestamp precision
๐ New logo:
-
v0.2.0 Changes
December 21, 2018No major changes other than the name.
-
v0.1.0 Changes
June 11, 2018๐ Warning: Running this version will move the old
html/
output folder to the new location:output/
.๐ Changes:
- entirely new folder structure & code layout
- ๐ moved scripts into
bin/
folder, symlinkedsetup
andarchive
for backwards-compatibility - removed TEMPLATE_INDEX* config options, just symlink the files in
templates/
to your custom versions - โ added support for
./bin/export-browser-history
JSON imports of browsing history from Chrome and Firefox
-
v0.0.3 Changes
October 30, 2017๐ New Features:
- ๐ Support for parsing links from RSS feeds
- ๐ Support for specifying a URL as well as local file paths:
./archive.py https://example.com/path/to/rss/feed.xml
- ๐ Support for
--user-data-dir
for archiving restricted sites with chrome headless - Simple & Fancy HTML & JSON indexes for each individual link
- Archive attempt history stored in link
index.json
๐ Improvements:
- Append to existing archive instead of overwriting the index each time
- โฌ๏ธ Reduced unnecessary config options, it should "just work"
- Smartly dedupe and cleanup messy archive folders
- Massively cleaned up codebase
-
v0.0.2 Changes
July 04, 2017- ๐จ refactor codebase into separate files
- check for minimum python version before running
- ๐ fix utf-8 encoding errors when writing index.html
- ๐ make index easier to customize with
templates/
folder - ๐ง WIP audio & video downloading with youtube-dl
-
v0.0.1 Changes
July 04, 2017It's reached a point where I'm comfortable bringing Bookmark Archiver out of alpha and into beta.
๐ This release supports a broad range of bookmark export files, works well with wget archiving, and produces clean, future-compatible archive folders.๐ See the README for more details and a list of features. Future releases will have a changelog.