Integrate worker pool for parallel downloading of assets #99
Labels
No labels
agent/review
priority/high
priority/low
priority/medium
status/blocked
status/wontfix
time/marathon
time/quickwin
type/bug
type/cicd
type/docs
type/feature
type/security
No milestone
No project
No assignees
1 participant
Notifications
Due date
No due date set.
Dependencies
No dependencies set.
Reference
thwap/hoover#99
Loading…
Add table
Add a link
Reference in a new issue
No description provided.
Delete branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Phase 2 of the roadmap includes "Download assets in parallel" with worker pool. The worker pool implementation exists in
internal/workerpoolbut is not integrated into the archiver's fetching loop.Currently, archiver fetches pages sequentially. To improve performance, we should use the worker pool to fetch pages (and later assets) concurrently, respecting rate limiting and depth constraints.
This task involves modifying
internal/archiver/archiver.go'sArchivemethod to use the worker pool for parallel downloading.Acceptance criteria:
This issue completes the remaining item of Phase 2.