WebExtrator Extract API
POSThttps://api.acedata.cloud/webextrator/extract
Request Headers
acceptstring
Specify the response format returned by the server.
Please select
authorizationstring
Bearer token
Request Body
urlstringRequired parameter
Need to extract the URL of the webpage content.
expected_typestring
Indicate the expected page type to optimize extraction results.
Please select
enable_llmboolean
Whether to enable optional LLM-based semantic normalization processing at the final stage of the extraction process.
Please select
wait_untilstring
Waiting conditions for page loading.
Please select
timeoutnumber
The total timeout for the extraction operation (unit: seconds).
delaynumber
Additional waiting time before starting extraction after the page has finished loading (unit: seconds).
wait_for_selectorstring
Wait for the CSS selector to appear before starting the extraction.
block_resourcesarray
Resource types to be intercepted during page loading to speed up extraction.
headersobject
Additional HTTP request headers sent along with the page request.
user_agentstring
Override the default `User-Agent` request header.
callback_urlstring
If this parameter is provided, the request will be processed asynchronously, and the final result will be sent to this URL via a POST request.
asyncboolean
Whether to process in asynchronous mode. When set to `true`, the interface immediately returns `task_id`, without the need to provide `callback_url`, and then the result can be obtained by polling the corresponding task query interface; if `callback_url` is also provided, the result will be pushed to that callback address.
Please select
Response
WebExtrator Web Render & Extract
Allow Use General Balance
When 'Allow General Balance' is enabled, the general balance is used automatically if an app's balance is insufficient.
Kind reminder: For streaming requests, the above code may not be fully applicable. Please refer to the integration documentation for changes.
