BOE Data Loader
This component connects your workflow to the official Spanish State Bulletin (BOE) database. It allows you to automatically retrieve legal documents, laws, regulations, daily summaries, and auxiliary reference data. You can search for specific keywords, find laws by their unique ID, or fetch the latest updates. The component organizes this information into a structured format, making it easy to use for analysis, reporting, or feeding into AI tools for summarization and understanding.
How it Works
The component acts as a bridge between your workflow and the BOE Open Data API. When you run this component, it sends a request to the official BOE servers with the criteria you provide, such as search terms, dates, or law identifiers. The API searches the official database and returns the results.
The component then organizes this raw information into a structured format that is easy to read and use in subsequent steps of your automation. It supports features like retrieving full text, fetching daily summaries (BOE and BORME), and filtering by category or language. It also handles details like pagination and language preferences automatically.
Inputs
Mapping Mode
This component has a special mode called Mapping Mode. When you enable this toggle, your inputs become more flexible and dynamic. Each input field will offer three different ways to provide data:
- Fixed: You type the value directly into the field.
- Mapped: You connect the output of another component to use its result as the value.
- Javascript: You write Javascript code to dynamically calculate the value.
This flexibility allows you to create smarter workflows where inputs can change based on previous steps or dynamic logic.
Input Fields
The following fields are available to configure this component:
- Search Term: Keywords or phrases to filter legislation, metadata, or summaries. Leave blank to get general results.
- Norm Identifier: Unique code of the law, real decree, resolution, or disposition you are looking for. Use this for precise searches.
- Publication Date: Specific date or range of publication for BOE or BORME. Helps find laws by when they were published.
- Norm Type: Classification of the norm, such as law, decree, agreement, or resolution. Filters results by category.
- Page Number: Page number for pagination when there are many results. Controls which page of results to fetch.
- Results per Page: Maximum number of records to return per request. Controls the size of the result set.
- Language: Language for searches or translations. Default is Spanish, but you can select other supported languages.
- Include Full Text: Checkbox to request the complete text of the law. Enable this only when you need the actual content, as it may increase processing time.
- Auxiliary Reference Type: Category of auxiliary data, such as organizations, articles, or glossaries. Used for reference data instead of legislation.
Outputs
This component produces structured data that you can connect to other components for analysis, storage, or reporting.
- Legislation Data: Metadata, citations, and links of the norms found.
- Norm Text: Full textual content of the law(s) if requested.
- Daily Summaries: List of daily bulletins (BOE and BORME) with titles and descriptions.
- Reference Data: Auxiliary information like issuing organizations or legal glossaries.
- Response Metadata: Technical info about the request, such as total pages and timestamps.
- Raw Response: Original payload from the API, useful for debugging.
Output Data Example
Below is an example of the structured data this component returns. You can map fields like norm_id or norm_text to subsequent nodes.
json
{
“legislation_data”: {
“numero”: “E.RDL-1/2023”,
“titulo”: “Decreto por el que se declara…”,
“categoria”: “Real Decreto”,
“fecha”: “2023-01-15”,
“resumen”: “Publicación de normativa sobre inteligencia artificial.”,
“url”: “https://www.boe.es/…”
},
“norm_text”: “El Congreso de los Diputados ha aprobado el siguiente Real Decreto…”,
“daily_summaries”: [
{
“titulo”: “Resumen del BOE de hoy”,
“fecha”: “2023-10-27”,
“url”: “https://www.boe.es/…”
}
],
“reference_data”: {
“organismos”: [
“Ministerio de Asuntos Económicos”,
“Agencia Española de Seguridad Alimentaria”
]
},
“metadata”: {
“total_pages”: 1,
“timestamp”: “2023-10-27T10:00:00Z”,
“status”: “success”
}
}
Connectivity
This component is typically used at the beginning of a data extraction workflow. It connects to downstream components that process or store the legal data.
- AI Analysis Tools: Connect
norm_textorlegislation_datato AI components for summarization, sentiment analysis, or extracting key clauses. - Text Parsers: Connect outputs to components that parse text into structured formats like tables or databases.
- Storage: Connect
legislation_dataordaily_summariesto database writers or file storage components to save the results. - Reporting: Connect
daily_summariesorreference_datato reporting tools to generate legal updates or dashboards. - Filtering: Connect
metadatato decision nodes to check if results were found or to handle errors based on status codes.
Usage Example
Scenario: Tracking new AI regulations
You want to find and analyze new laws related to “Artificial Intelligence” published today.
-
Configure Inputs:
- Set Search Term to
Inteligencia Artificial. - Set Publication Date to
Todayor a specific date. - Leave Norm Type empty to include all categories.
- Enable Include Full Text if you need to read the law details.
- Set Search Term to
-
Execute Component:
- Run the component to fetch the data from the BOE.
-
Connect Outputs:
- Connect Norm Text to an AI Summarizer to get a quick overview of each law.
- Connect Legislation Data to a Database Writer to save the metadata for future reference.
This workflow helps you stay updated with relevant legislation and automatically analyze its content.
Tips and Best Practices
- Precision vs. Breadth: Use Norm Identifier when you know the exact law code for fast results. Use Search Term for broader queries.
- Performance: Include Full Text increases the size of the output and may take longer. Only enable it when you actually need the text content.
- Pagination: For large datasets, adjust Page Number and Results per Page to control how much data you fetch at once.
- Reference Data: Use Auxiliary Reference Type when you need glossaries or lists of organizations instead of legal texts.
- Language: Ensure Language matches your workflow needs; Spanish is default but other languages may be available for translations.
- Development Mode: This component is optimized for development. If you encounter caching or debugging behaviors, review the development settings in your workflow configuration.
Security Considerations
- This component interacts with public official data from the BOE API. No sensitive personal data is processed by this component.
- Ensure your workflow respects the rate limits of the BOE API to avoid connection issues.
- When using Include Full Text, be mindful of the data volume to maintain workflow efficiency.