Skip to content

Files[]

.halguru-webscraping.yamlPagesFiles

Represents a collection of files associated with the webpage for processing or extraction.

Pages:
  Files:
    - Name: Any text
      TagName: Any text
      NameRegex: Any text
      UrlRegex: Any text
      NameXpath: Any text
      UrlXpath: Any text

Properties#

Name Type Required Description
Name Text ✔️ The name of the website feature.
TagName Text ✔️ The tag name used to identify or categorize the website feature.
NameRegex Text The regular expression pattern to identify the name component of a website feature.
UrlRegex Text The regular expression pattern for matching URLs associated with the website file.
NameXpath Text The XPath expression used to locate the name of a specific feature within the website content.
UrlXpath Text The XPath expression used to extract the URL from a website file's content.

Technical Information#

Property Value
Path Pages[].Files[]
Internal Type WebScrappingModels.WebScrapingFile
Internal Root Type WebScrapingConfiguration
File Extension .halguru-webscraping.yaml
JSON Schema halguru-webscraping-schema.json

Last updated: 2025-10-13
Autogenerated: Yes
AI powered: Yes
Core version: 1.66.0