Files[]
.halguru-webscraping.yaml ➤ Pages[] ➤ Files[]
Represents a collection of files associated with the webpage for processing or extraction.
Pages:
- Files:
- Name: Any text
TagName: Any text
NameRegex: Any text
UrlRegex: Any text
NameXpath: Any text
UrlXpath: Any text
| Name |
Description |
| Title |
Files |
| Field Type |
ObjectList |
| Required |
True |
Field List
| Name |
Type |
Required |
Description |
| Name |
Text |
✔️ |
The name of the website feature. |
| TagName |
Text |
✔️ |
The tag name used to identify or categorize the website feature. |
| NameRegex |
Text |
|
The regular expression pattern to identify the name component of a website feature. |
| UrlRegex |
Text |
|
The regular expression pattern for matching URLs associated with the website file. |
| NameXpath |
Text |
|
The XPath expression used to locate the name of a specific feature within the website content. |
| UrlXpath |
Text |
|
The XPath expression used to extract the URL from a website file's content. |
| Property |
Value |
| Path |
Pages[].Files[] |
| Internal Type |
WebScrappingModels.FileItem |
| Internal Root Type |
WebScrapingHalGuru |
| File Extension |
.halguru-webscraping.yaml |
| JSON Schema |
halguru-webscraping-schema.json |
| Last updated: | | 2026-03-19 |
| Autogenerated: | | Yes |
| AI powered: | | Yes |
| Core version: | | 1.93.0 |