Hyperbrowser is a platform for running and scaling headless browsers. It lets you launch and manage browser sessions at scale and provides easy to use solutions for any webscraping needs, such as scraping a single page or crawling an entire site. Key Features:For more information about Hyperbrowser, please visit the Hyperbrowser website or if you want to check out the docs, you can visit the Hyperbrowser docs.
- Instant Scalability - Spin up hundreds of browser sessions in seconds without infrastructure headaches
- Simple Integration - Works seamlessly with popular tools like Puppeteer and Playwright
- Powerful APIs - Easy to use APIs for scraping/crawling any site, and much more
- Bypass Anti-Bot Measures - Built-in stealth mode, ad blocking, automatic CAPTCHA solving, and rotating proxies
Installation and Setup
To get started withlangchain-hyperbrowser
, you can install the package using pip:
HYPERBROWSER_API_KEY=<your-api-key>
Make sure to get your API Key from https://app.hyperbrowser.ai/
Available Tools
Hyperbrowser provides two main categories of tools that are particularly useful for:- Web scraping and data extraction from complex websites
- Automating repetitive web tasks
- Interacting with web applications that require authentication
- Performing research across multiple websites
- Testing web applications
Browser Agent Tools
Hyperbrowser provides a number of Browser Agents tools. Currently we supported- Claude Computer Use
- OpenAI CUA
- Browser Use
Browser Use Tool
A general-purpose browser automation tool that can handle various web tasks through natural language instructions.OpenAI CUA Tool
Leverages OpenAI’s Computer Use Agent capabilities for advanced web interactions and information gathering.Claude Computer Use Tool
Utilizes Anthropic’s Claude for sophisticated web browsing and information processing tasks.Web Scraping Tools
Here is a brief description of the Web Scraping Tools available with Hyperbrowser. You can see more details hereScrape Tool
The Scrape Tool allows you to extract content from a single webpage in markdown, HTML, or link format.Crawl Tool
The Crawl Tool enables you to traverse entire websites, starting from a given URL, with configurable page limits.Extract Tool
The Extract Tool uses AI to pull structured data from web pages based on predefined schemas, making it perfect for data extraction tasks.Document Loader
TheHyperbrowserLoader
class in langchain-hyperbrowser
can easily be used to load content from any single page or multiple pages as well as crawl an entire site.
The content can be loaded as markdown or html.
Advanced Usage
You can specify the operation to be performed by the loader. The default operation isscrape
. For scrape
, you can provide a single URL or a list of URLs to be scraped. For crawl
, you can only provide a single URL. The crawl
operation will crawl the provided page and subpages and return a document for each page.
params
argument. For more information on the supported params, visit https://docs.hyperbrowser.ai/reference/sdks/python/scrape#start-scrape-job-and-wait or https://docs.hyperbrowser.ai/reference/sdks/python/crawl#start-crawl-job-and-wait.