Skip to content

Add always_enqueue option to Request for bypassing deduplication #547

@vdusek

Description

@vdusek
  • Add an always_enqueue option (or use a better name for it, but avoid negative terms) as an input parameter to the Request.from_url constructor.
    • This will allow users to easily opt out of the request deduplication process.
  • Implement the option as a convenient wrapper that generates a random unique_key, ensuring that each request is always enqueued and processed.
  • Address edge cases where both unique_key and always_enqueue=True are provided.
  • It should work in the same way as the dont_filter option in Scrapy (docs).

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request.hacktoberfestt-toolingIssues with this label are in the ownership of the tooling team.

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions