NEWS
archiveRetriever 0.4.0 (2024-06-11)
- Replace deprecated functions of dependencies
- Fix bugs in archive_overview() and retrieve_urls()
- New option nonArchive added to retrieve_links() and scrape_urls(). This option allows users to scrape internet pages not stemming from the Internet Archive.
- New feature added to the collapse option of scrape_urls(). collapse can now also take a Xpath as input, to collapse results based on a structuring Xpath. Unfortunately, this works only with Xpaths and not with CSS selectors. If used, Paths refers only to children of the structuring Xpath given in collapse.
archiveRetriever 0.3.1 (2022-12-23)
- Changes to the testing environment.
- Disable progress bar in non-interactive use.
archiveRetriever 0.3.0 (2022-12-20)
- Fixes to filtering of links in retrieve_links() to enable link scraping from domains with more than one domain ending.
- New option filter added to retrieve_links(). This options allows to disable the filtering of links to be sub-domains of the top-level domain.
- New option pattern added to retrieve_links(). This option allows for custom patterns by which links are filtered before output.
archiveRetriever 0.2.0 (2022-06-21)
- New option collapseDate added to retrieve_urls(). This option allows users to choose whether retrieve_urls outputs all or just one memento per requested day.
archiveRetriever 0.1.2 (2022-06-07)
- Fixes to ignoreErrors option for html reading-errors in scrape_urls()
- Fixes to retrieve_links() for Errors occurring in last Url
- Improve compatibility between retrieve_links() and scrape_urls()
archiveRetriever 0.1.1 (2022-03-03)
- Fixes to ignoreErrors option for encoding errors in retrieve_links()
archiveRetriever 0.1.0 (2021-05-27)
- Fixes to function behavior in case of timeout.
- Changes to the preliminary output printed by scrape_urls() function in case of error.
- Integration of more flexibility for using the scrape_urls() option attachto
- New option collapse added to scrape_urls(). This option allows users to choose whether html elements retrieved via the archiveRetriever are collapsed into a single observation or are kept as different observations in the output dataset.
- Update of the package documentation.
archiveRetriever 0.0.2 (2021-03-19)
- Minor fixes and adjustments to Error-Messages
- More stable testing environment
archiveRetriever 0.0.1 (2021-03-10)
- Final version for Cran submission
archiveRetriever 0.0.0.9000
- Added a
NEWS.md
file to track changes to the package.