Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Generate fallback URLs for web archive links #804

Merged
merged 3 commits into from
Aug 29, 2024

Conversation

sissbruecker
Copy link
Owner

Removes logic that tries to find an existing snapshot through the web archive CDX API as a fallback when creating a new snapshot fails, or when importing bookmarks that have been added some time ago. The API's rate limit is hard to work around with the current architecture and causes issues with snapshot tasks being queued for a long time.

Instead, generate a fallback URL from the bookmark's URL and added timestamp. That should automatically display the nearest snapshot for that timestamp. Worst case a website might not have any snapshot, in which case the URL would be invalid.

Fixes #747

@sissbruecker sissbruecker merged commit 749bc1e into master Aug 29, 2024
2 checks passed
@sissbruecker sissbruecker deleted the fix/generate-fallback-web-archive-urls branch August 29, 2024 20:45
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Internet archive huey task should not keep retrying forever
1 participant