Files[]
.halguru-webscraping.yaml ➤ Pages ➤ Files
Represents a collection of files associated with the webpage for processing or extraction.
Pages:
Files:
- Name: Any text
TagName: Any text
NameRegex: Any text
UrlRegex: Any text
NameXpath: Any text
UrlXpath: Any text
Properties
| Name |
Type |
Required |
Description |
| Name |
Text |
✔️ |
The name of the website feature. |
| TagName |
Text |
✔️ |
The tag name used to identify or categorize the website feature. |
| NameRegex |
Text |
|
The regular expression pattern to identify the name component of a website feature. |
| UrlRegex |
Text |
|
The regular expression pattern for matching URLs associated with the website file. |
| NameXpath |
Text |
|
The XPath expression used to locate the name of a specific feature within the website content. |
| UrlXpath |
Text |
|
The XPath expression used to extract the URL from a website file's content. |
| Property |
Value |
| Path |
Pages[].Files[] |
| Internal Type |
WebScrappingModels.FileHalItem |
| Internal Root Type |
WebScrapingHalConfiguration |
| File Extension |
.halguru-webscraping.yaml |
| JSON Schema |
halguru-webscraping-schema.json |
| Last updated: | | 2025-12-05 |
| Autogenerated: | | Yes |
| AI powered: | | Yes |
| Core version: | | 1.75.0 |