Files[]
.halguru-webscraping.yaml ➤ Pages ➤ Files
Represents a collection of files associated with the webpage for processing or extraction.
Pages:
Files:
- Name: Any text
TagName: Any text
NameRegex: Any text
UrlRegex: Any text
NameXpath: Any text
UrlXpath: Any text
Properties
Name |
Type |
Required |
Description |
Name |
Text |
✔️ |
The name of the website feature. |
TagName |
Text |
✔️ |
The tag name used to identify or categorize the website feature. |
NameRegex |
Text |
|
The regular expression pattern to identify the name component of a website feature. |
UrlRegex |
Text |
|
The regular expression pattern for matching URLs associated with the website file. |
NameXpath |
Text |
|
The XPath expression used to locate the name of a specific feature within the website content. |
UrlXpath |
Text |
|
The XPath expression used to extract the URL from a website file's content. |
Property |
Value |
Path |
Pages[].Files[] |
Internal Type |
WebScrappingModels.WebScrapingFile |
Internal Root Type |
WebScrapingConfiguration |
File Extension |
.halguru-webscraping.yaml |
JSON Schema |
halguru-webscraping-schema.json |
Last updated: | | 2025-10-13 |
Autogenerated: | | Yes |
AI powered: | | Yes |
Core version: | | 1.66.0 |