Skip to content

Support connecting to browser using connect() instead of launch() #892

Open
@Kilowhisky

Description

@Kilowhisky

Is your feature request related to a problem? Please describe.
When trying to automate the process of scraping i've found that trying to run the local browser inside a lambda that handles multiple sessions causes problems with the browser when its built into the app.

Describe the solution you'd like
Playwright lets you setup a remote server and connect to it using web sockets.

see here for example:
https://github.com/pixelfactoryio/playwright-server

I would like to see a configuration that lets me specify that it should connect to one of these servers instead of trying to launch the browser locally.

see here for how to connect() instead of launch() the browser:
https://playwright.dev/python/docs/api/class-browsertype#browser-type-connect

Describe alternatives you've considered
I've played around with scrape_do and browser_base and they both have some drawbacks and sometimes its best to have total control over the browser that is being launched.

Additional context
There seems to be some bugs when trying to run scrapegraph in a long running process that handles multiple requests. It would be just easier if I can place the chrome server on a machine with a GPU and let scrapegraph just control it remotely.

Metadata

Metadata

Assignees

No one assigned

    Labels

    feature requeststaleIssue has not had recent activity or appears to be solved. Stale issues will be automatically closed

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions