Skip to content

Files[]

.halguru-webscraping.yamlPages[]Files[]

Represents a collection of files associated with the webpage for processing or extraction.

Pages:
  - Files:
      - Name: Any text
        TagName: Any text
        NameRegex: Any text
        UrlRegex: Any text
        NameXpath: Any text
        UrlXpath: Any text

Field Information#

Name Description
Title Files
Field Type ObjectList
Required True

Field List#

Name Type Required Description
Name Text ✔️ The name of the website feature.
TagName Text ✔️ The tag name used to identify or categorize the website feature.
NameRegex Text The regular expression pattern to identify the name component of a website feature.
UrlRegex Text The regular expression pattern for matching URLs associated with the website file.
NameXpath Text The XPath expression used to locate the name of a specific feature within the website content.
UrlXpath Text The XPath expression used to extract the URL from a website file's content.

Technical Information#

Property Value
Path Pages[].Files[]
Internal Type WebScrappingModels.FileItem
Internal Root Type WebScrapingHalGuru
File Extension .halguru-webscraping.yaml
JSON Schema halguru-webscraping-schema.json

.halguru-webscraping.yaml#

Reference Index#

Configuration Files#


Last updated: 2026-03-19
Autogenerated: Yes
AI powered: Yes
Core version: 1.93.0