Skip to content

v0.2.99

Latest
Compare
Choose a tag to compare
@D4Vinci D4Vinci released this 08 Apr 04:42
· 13 commits to main since this release
96e3c3d

This is an essential update for everyone to fully enjoy Scrapling as it's intended.

What's changed

New full documentation website

  • Yup, finally 😄 Check it out from here

Unified import logic for fetchers

  • Now you can import all fetchers with from scrapling.fetchers import Fetcher, AsyncFetcher, StealthyFetcher, PlayWrightFetcher, then use them directly like page = Fetcher.get(...) without initialization.
    This replaces this old import from scrapling.defaults import Fetcher, AsyncFetcher, StealthyFetcher, PlayWrightFetcher.

Breaking change: automatch is now turned off by default

  • Now there's new logic to enable automatch from fetchers or other parsing options. Check out the documentation page for details.

Old imports and logic are left usable with a warning for backward compatibility.

New options added to fetchers

  • Now, both StealthyFetcher and PlayWrightFetcher have a new argument while fetching called wait. This makes the fetcher wait/sleep for a specific period (milliseconds) before closing the page and returning the response to you.
  • Now StealthyFetcher methods fetch and async_fetch have the argument additional_arguments to be passed to Camoufox as additional settings, which takes higher priority than Scrapling's settings (#54 )

Bugs squashed

  • Fixed a bug in async_fetch in both StealthyFetcher and PlayWrightFetcher classes, with catching redirections.

Thanks for all your support and donations!


Big shoutout to our biggest Sponsor: Scrapeless

Scrapeless Banner