OpenStreetMap Integration #236

cdgaete · 2025-04-23T03:39:05Z

Closes #12
Supersedes #225

Changes proposed in this Pull Request

Add new capability to extract power plant data from OpenStreetMap (OSM)
Implement caching system to reduce data download requirements
Add support for various power plant types from OSM data
Include capacity estimation for plants with missing information
Add configuration options in config.yaml for customizing OSM extraction
Include example script comparing OSM data with GEM database

Current Implementation Focus
The OSM module has been designed with several key architectural components:

Comprehensive Data Processing Pipeline: A structured workflow that processes OSM elements based on hierarchy (relations, ways, nodes), with specialized handling for each element type
Advanced Geometric Analysis: Utilizes spatial relationships to identify overlapping power plants and prevent duplication
Multi-level Data Extraction: Implements a cascading approach to extract capacity information:
- Direct extraction from tags
- Pattern-based extraction using configurable regex
- Estimation based on geographic properties when direct values aren't available
Efficient Caching Architecture: Implemented a multi-tiered caching system that significantly reduces API requests and improves performance, enabling quick analysis of large geographic areas
Rejection Tracking System: The rejection tracker serves two critical purposes:
- Configuration refinement tool: Helps identify configuration gaps that can be addressed to improve data capture
- Quality improvement mechanism: Provides structured feedback that can be shared with the OSM community

Benefits

Access new data source with global coverage for power plants
Complement existing databases with community-maintained data
Improve coverage in regions with limited official data
Get standardized output compatible with all powerplantmatching functions
Help OSM mappers identify and fix data issues through rejection tracking
Contribute to a continuous feedback loop that improves both OSM and powerplantmatching data quality
Fine-tune data collection through config options based on rejection insights

Checklist

✅ Code changes are sufficiently documented
✅ Tests for new features were added
✅ Release notes have been updated
✅ I consent to the release of this PR's code under the MIT license

…gaete/powerplantmatching into feature_osm_perf_improvements

cdgaete · 2025-05-12T07:12:33Z

OSM Integration Example

I've added an example script demonstrating the new OSM module functionality. The script (analysis/osm_example_ppm.py) compares power plant data from OpenStreetMap with Global Energy Monitor data for three countries: Chile, South Africa, and Indonesia.

Example Script:

# Main execution
if __name__ == "__main__":
    # Set up output directory
    output_dir = "outputs"
    os.makedirs(output_dir, exist_ok=True)
    
    # List of countries to process
    countries = [
        "Chile",
        "South Africa",
        "Indonesia",
    ]
    
    # Get the base configuration
    config = get_config()
    config["main_query"] = ""
    config["target_countries"] = countries
    config["OSM"]["force_refresh"] = False
    config["OSM"]["plants_only"] = False
    config["OSM"]["units_clustering"]["enabled"] = False
    config["OSM"]["missing_name_allowed"] = False
    config["OSM"]["missing_technology_allowed"] = False
    config["OSM"]["missing_start_date_allowed"] = True
    
    # Get combined data for all countries
    osm_data = OSM(raw=False, update=False, config=config)
    gem_data = GEM(raw=False, update=False, config=config)
    
    fig, axis = fueltype_totals_bar([gem_data, osm_data], keys=["GEM", "OSM"])
    plt.savefig(os.path.join(output_dir, "osm_gem_ppm.png"))
    
    fig, axis = fueltype_and_country_totals_bar([gem_data, osm_data], keys=["GEM", "OSM"])
    plt.savefig(os.path.join(output_dir, "osm_gem_ppm_country.png"))

Results:

The example generates two plots comparing OSM and GEM data:

Country-specific breakdown by fuel type:
Aggregated capacity by fuel type:

These results show that OSM provides data quality comparable to specialized sources like GEM for major generation types (Hard Coal, Hydro, Natural Gas), with particularly strong alignment in South Africa. The OSM module successfully captures most major power plant types while also providing additional data for some categories (like Oil and Waste) that may be missing in other sources.

The differences highlight opportunities for enhancements, particularly for Wind and Geothermal power plants, which show lower coverage in OSM. This will be covered in the coming tasks where the focus is data validation, and code edition towards data quality enhancement.

cdgaete · 2025-05-16T17:53:03Z

Next Steps for OSM Implementation

Data Validation Across Multiple Countries

The next phase will focus on thorough data validation:

Testing several countries worldwide while updating the configuration file's source and tech mapping to maximize OSM element inclusion
Identifying potential enhancements based on findings and challenges encountered during testing
Comparing results with other global power plant datasets and reporting key differences
- GEM datasets already implemented in PPM
Using these comparisons to enhance the tool when inconsistencies or code issues are found
Testing and reporting on PPM's matching feature performance, determining optimal configuration settings for matching datasets with the OSM implementation
- reliability_score

Documentation and Tutorials

To support the implementation, development of supporting materials will include:

Step-by-step guides for using the OSM module with practical examples
Documentation of key configuration options and their effects on data collection
Basic tutorials showing common use cases and recommended settings

- Document comprehensive OpenStreetMap data source with advanced processing - Highlight dual purpose for energy analysts and OSM community - Include details on caching, enhancement features, and quality control

…en-energy-transition/powerplantmatching into feature_osm_perf_improvements

FabianHofmann · 2025-07-23T12:24:18Z

@cdgaete I have tested your example all it works impressively well. would you mind fixing the remaining failing tests (it is probably just fixing a deprecation, we can also drop support for python 3.9 if needed). then we can finally merge

Breaking changes: - Minimum Python version is now 3.10 (was 3.9) - Remove mypy and all type checking infrastructure Changes: - Update requires-python to >=3.10 in pyproject.toml - Remove Python 3.9 from classifiers - Delete type-checking.yml GitHub workflow - Remove mypy and type stub dependencies (mypy, types-requests, types-PyYAML, pandas-stubs, types-tqdm, types-six) - Remove [tool.mypy] configuration section from pyproject.toml - Update release notes to document breaking changes - Update isinstance calls to use Python 3.10+ union syntax (X | Y)

- Replace Optional[X] with X | None throughout OSM module - Update isinstance calls to use X | Y syntax - Apply ruff formatting for consistency with Python 3.10+ standards

lkstrp · 2025-07-31T07:17:54Z

Can either @fneum or myself have another look before merging? Ref to first Review #225

fneum · 2025-07-31T07:26:10Z

Yes, I would like to have a look, too. Before merging, we need to understand what impact it has on the powerplants.csv in the default config (that is used in PyPSA-Eur). This should be another validation step.

Copilot

Pull Request Overview

This pull request adds comprehensive OpenStreetMap (OSM) integration capabilities to powerplantmatching, enabling extraction of power plant data from the global OpenStreetMap database. The implementation provides a robust data processing pipeline with caching, quality tracking, and analysis tools.

Implements a complete OSM data extraction pipeline with hierarchical processing (relations → ways → nodes)
Adds sophisticated multi-level caching system to minimize API calls and improve performance
Introduces capacity estimation, plant reconstruction, and rejection tracking for data quality improvement

Reviewed Changes

Copilot reviewed 43 out of 45 changed files in this pull request and generated 4 comments.

Show a summary per file

File	Description
pyrightconfig.json	Adds Pyright type checker configuration for basic type checking
pyproject.toml	Updates Python version requirement to 3.10+, adds new dependencies (scikit-learn, shapely), removes mypy configuration
powerplantmatching/plot.py	Removes large commented code blocks, adds new plotly_map function for interactive mapping
powerplantmatching/package_data/config.yaml	Adds comprehensive OSM configuration section with API settings, source mappings, and processing parameters
powerplantmatching/osm/workflow.py	Implements main workflow orchestration for OSM data processing pipeline
powerplantmatching/osm/utils.py	Provides utility functions for capacity parsing, country validation, and cache path management
powerplantmatching/osm/retrieval/regional.py	Implements regional download functionality for custom geographic areas
powerplantmatching/osm/retrieval/populate.py	Provides cache population utilities for batch processing multiple countries
powerplantmatching/osm/retrieval/client.py	Core Overpass API client with retry logic, caching, and progress tracking
powerplantmatching/osm/retrieval/cache.py	Multi-level caching system for OSM elements and processed units
powerplantmatching/osm/retrieval/init.py	Package initialization for retrieval module
powerplantmatching/osm/quality/rejection.py	Comprehensive rejection tracking system for data quality analysis
powerplantmatching/osm/quality/coverage.py	Cache coverage analysis and maintenance tools
powerplantmatching/osm/quality/init.py	Package initialization for quality module
powerplantmatching/osm/parsing/plants.py	Parser for OSM power plant relations with reconstruction capabilities

Comments suppressed due to low confidence (2)

pyproject.toml:13

Removing Python 3.9 support is a breaking change. Consider documenting this in release notes or maintaining backward compatibility if existing users depend on Python 3.9.

license = { file = "LICENSE" }

powerplantmatching/osm/parsing/plants.py:434

Using assert for runtime validation in production code is problematic. The assertion could be disabled with python -O. Consider raising a proper exception like ValueError or RuntimeError instead.

                mismatch_ratio = (

Copilot · 2025-07-31T07:30:08Z

powerplantmatching/osm/utils.py

+            return False, None, "regex_error"
+    else:
+        if regex_patterns is None:
+            regex_patterns = [


The regex patterns for capacity parsing are hardcoded. Consider moving these to configuration to make them easily customizable without code changes.

I would not move these away, but add a comment what each regex intends to match

Copilot · 2025-07-31T07:30:09Z

powerplantmatching/osm/retrieval/client.py

+        while retries < self.max_retries:
+            try:
+                response = requests.post(
+                    self.api_url, data={"data": query}, timeout=self.timeout + 30


The timeout value is calculated as self.timeout + 30 but this could result in very long timeouts if self.timeout is large. Consider setting a maximum timeout limit to prevent indefinite waits.

Suggested change

self.api_url, data={"data": query}, timeout=self.timeout + 30

self.api_url, data={"data": query}, timeout=min(self.timeout + 30, MAX_TIMEOUT)

Copilot · 2025-07-31T07:30:09Z

powerplantmatching/osm/retrieval/cache.py

+
+    def _save_cache(self, cache_path: str, data: dict) -> None:
+        """Save dictionary to JSON cache file."""
+        cache_data = data if data else {}


[nitpick] The condition data if data else {} can be simplified to data or {} for better readability.

Suggested change

cache_data = data if data else {}

cache_data = data or {}

Copilot · 2025-07-31T07:30:09Z

powerplantmatching/osm/quality/rejection.py

+                rejections_to_process.extend(rejections)
+
+        for rejection in rejections_to_process:
+            if rejection.coordinates is None or "cluster" in rejection.id.lower():


Using string matching with "cluster" in rejection.id.lower() is fragile. Consider adding a proper flag or enum value to identify cluster rejections more reliably.

FabianHofmann · 2025-07-31T07:33:09Z

it does not have affect on the default powerplants.csv as it is not added to the default matching sources. so at this stage it is purely about adding an optional source. however @cdgaete would you mind making a short comparison of the powerplants.csv with and without OSM (preferrably for europe) ?

maurerle

This is huge.
It increases the code base by a multiple of its current size and is hard to review.

Files which are not essential to the features of this PR are changed here as well.

I made some code remarks for better readability.
Recommendations to split functions or reduce the scope of classes (by not including the config everywhere).

maurerle · 2025-08-03T20:20:47Z

.pre-commit-config.yaml

+# -   repo: https://github.com/RobertCraigie/pyright-python
+#     rev: v1.1.383
+#     hooks:
+#     -   id: pyright


I would remove this unrelated change

maurerle · 2025-08-03T20:22:29Z

powerplantmatching/accessor.py

unrelated cchange

maurerle · 2025-08-03T20:23:18Z

powerplantmatching/data.py

-# GNU General Public License for more details.
-
-# You should have received a copy of the GNU General Public License
-# along with this program.  If not, see <http://www.gnu.org/licenses/>.


instead of full removal, maybe add SPDX-Headers?

maurerle · 2025-08-03T20:32:13Z

powerplantmatching/osm/retrieval/populate.py

+    logger.info(f"Total countries to process: {len(all_countries)}")
+
+    if dry_run:
+        print("\nDry run - would download the following countries:")


Use logger instead of printing?

maurerle · 2025-08-03T20:33:41Z

powerplantmatching/osm/utils.py

+            return False, None, "regex_error"
+    else:
+        if regex_patterns is None:
+            regex_patterns = [


I would not move these away, but add a comment what each regex intends to match

maurerle · 2025-08-03T21:08:02Z

powerplantmatching/osm/quality/coverage.py

+
+    total_elements = total_plants + total_generators
+
+    if check_live_counts:


maybe extract this branch into a separate function which updates the passed cached_countries?

maurerle · 2025-08-03T21:09:52Z

powerplantmatching/osm/quality/coverage.py

+                    )
+                    cached_countries[country_code]["cache_status"] = "error"
+
+    if return_data:


instead of spending another 100 lines of code in this function, I would rather create a function which works on the output dict and just handles the printing.
That way, we can spare the return_data parameter as well and receive cleaner code.

maurerle · 2025-08-03T21:13:46Z

powerplantmatching/osm/quality/rejection.py

Looks good.

maurerle · 2025-08-03T21:22:23Z

powerplantmatching/osm/enhancement/reconstruction.py

+        Minimum similarity ratio for common substrings
+    """
+
+    def __init__(self, config: dict[str, Any]):


This class only needs name_similarity_threshold - why give it the whole config?

maurerle · 2025-08-03T21:23:00Z

powerplantmatching/osm/enhancement/reconstruction.py

+
+    def __init__(
+        self,
+        config: dict[str, Any],


This class only needs the min_generators_for_reconstruction - why give it the whole config?

…fig management - Refactor classes to accept specific parameters instead of full config objects - Split large functions in coverage.py into focused, single-purpose functions - Replace print statements with proper logging throughout codebase - Add SPDX license headers to all new files - Replace runtime assertions with proper exception handling - Enhance caching with diskcache for better memory management - Add comprehensive European power plant analysis (39K plants, 850+ GW OSM contribution) - Improve error handling and CSV corruption recovery - Add omitted countries support for API limitations

cdgaete · 2025-09-26T04:44:16Z

Response to Review Feedback

Thank you @maurerle, @FabianHofmann, @fneum, and @lkstrp for the thorough reviews and constructive feedback. I've made substantial improvements to address the core concerns raised about code organization, configuration management, and maintainability. Disclaimer: the huge jump in line of codes is associated to a html report in the documentation.

@maurerle's Review Comments

Configuration Dependencies Reduction:

Issue: Classes receiving entire config objects when only needing specific parameters
Resolution: Refactored key classes to accept only required parameters:
- NameAggregator: Now takes similarity_threshold: float = 0.7 instead of full config
- PlantReconstructor: Now takes min_generators: int instead of full config
Files Changed: powerplantmatching/osm/enhancement/reconstruction.py, powerplantmatching/osm/parsing/plants.py, powerplantmatching/osm/parsing/generators.py

Function Decomposition:

Issue: Large functions with multiple responsibilities, particularly in coverage analysis
Resolution: Split show_country_coverage into focused functions:
- get_country_coverage_data() - data collection only
- print_coverage_report() - formatting and display
- Helper functions: _print_status_summary(), _print_country_table(), _print_missing_countries(), _print_continent_breakdown()
Files Changed: powerplantmatching/osm/quality/coverage.py

Logging Instead of Print Statements:

Issue: Usage of print() statements for output
Resolution: Replaced all print() calls with appropriate logger.info(), logger.warning(), etc.
Files Changed: powerplantmatching/osm/retrieval/populate.py and others

License Headers:

Issue: Removal of license information

Resolution: Added SPDX license headers throughout codebase:

# SPDX-FileCopyrightText: Contributors to powerplantmatching <https://github.com/pypsa/powerplantmatching>
# SPDX-License-Identifier: MIT

Files Changed: All new OSM module files

Unrelated Changes:

Issue: Modifications to pre-commit configuration and other files
Resolution: Removed pyright configuration changes from .pre-commit-config.yaml as requested

@copilot Comments

Runtime Assertions:

Issue: Using assert statements for runtime validation in production code

Resolution: Replaced with proper exception handling:

# Before
assert existing_capacity_source, "..."

# After  
if existing_capacity_source is None:
    raise ValueError("Existing capacity source should not be None if existing_capacity is not None")

Files Changed: powerplantmatching/osm/parsing/plants.py

Breaking Changes Documentation:

Issue: Python 3.9 support removal not properly documented
Resolution: Documented breaking changes in commit messages and updated project classifiers appropriately

Code Simplification:

Issue: Verbose conditional expressions
Resolution: Simplified expressions like data if data else {} to data or {}
Files Changed: powerplantmatching/osm/retrieval/cache.py

Regex Pattern Documentation:

Issue: Hardcoded regex patterns without explanation

Resolution: Added comprehensive comments explaining what each pattern matches:

# Matches: "100 MW", "15.5 kW", "50 MWp" (number with optional space and unit)
r"^(\d+(?:\.\d+)?)\s*([a-zA-Z]+(?:p|el|e)?)$",
# Matches: "100MW", "15.5kW", "50MWp" (number directly followed by unit)  
r"^(\d+(?:\.\d+)?)([a-zA-Z]+(?:p|el|e)?)$",

Files Changed: powerplantmatching/osm/utils.py

Additional Architectural Improvements

Enhanced Caching System:

Introduced diskcache for large global caches (nodes, ways, relations) with size limits
Maintained JSON caching for small country-specific data
Added proper connection management with context managers and cleanup
Implemented cache corruption detection and recovery

Improved Error Handling:

Enhanced CSV reading with proper quoting and error recovery
Added corrupted cache file detection and automatic cleanup
Better exception handling throughout the processing pipeline

Configuration Management:

Added support for omitted_countries to handle API limitations (e.g., Kosovo)
Enhanced country validation with better error messages and suggestions
Improved parameter validation and type hints

Documentation and European Power Plant Analysis (addressing @FabianHofmann and @fneum's request for understanding OSM impact):

Added comprehensive European power plant analysis with interactive visualizations covering 39,155 plants across Europe
Key findings: OSM contributes 850+ GW (55% of total European capacity), with 115 GW exclusively from OSM-only plants
Geographic distribution: France leads with 23.8 GW OSM-only capacity, followed by Spain (13.5 GW) and Germany (13.5 GW)
Technology coverage: Strong OSM representation in pumped storage (80%), CCGT (75%), and hydro (70%+), with growth opportunities in offshore wind and solar PV
Dual role identified: OSM both discovers new plants (17.5% of plants, 7.4% capacity) and validates existing infrastructure (21% of plants, 47.6% capacity)
This analysis demonstrates that OSM as an optional source provides substantial value without affecting the default powerplants.csv
Created downloadable analysis scripts (osm_ppm_eu_analysis.py) for reproducibility
Enhanced module documentation with detailed usage examples and interactive maps

Comments Evaluated but Not Implemented

Timeout Management (Copilot AI suggestion):

Issue: Timeout calculation self.timeout + 30 could result in very long timeouts
Evaluation: Current implementation works reliably in practice. The timeout is calculated per request and defaults to reasonable values (300s + 30s buffer). Adding maximum limits would add complexity without significant benefit given typical API response times.

String Matching for Rejection Types (Copilot AI suggestion):

Issue: Using "cluster" in rejection.id.lower() is fragile
Evaluation: This string matching serves a specific purpose in rejection tracking where cluster IDs are clearly identifiable by naming convention. The impact is limited to internal debugging/tracking with no effect on data processing. The current approach is simple and effective for the intended use case.

Viewing the EU Analysis Report

The comprehensive European analysis is available in multiple formats:

OSM_Data_Coverage_Analysis_documentation.pdf

Interactive Map: Clone the branch and open doc/_static/europe_power_plants_osm_comparison.html in your browser to explore 39K+ plants with OSM involvement comparison.

Static Analysis: The downloadable script doc/_static/osm_ppm_eu_analysis.py generates all visualizations and can be run independently.

Documentation: See doc/osm-data-analysis.rst for the full analysis writeup with key findings and methodology.

Please let me know if any specific areas need further clarification or if you'd like me to address any of the remaining optimization suggestions.

@euronion

brynpickering · 2025-10-07T13:01:09Z

doc/_static/europe_power_plants_osm_comparison.html

@cdgaete could you use scatter clustering for easier viewing of your map?

FabianHofmann · 2025-11-10T12:55:06Z

@cdgaete thanks again for the PR. I would like to pull that in asap. could you resolve the conflicts?

@fneum you want to take another look?

fneum · 2025-11-10T13:37:20Z

I don't think I will have enough time. I am fine as long as it's not in the default matching sources, but generally share the sentiment of @maurerle that this PR is basically a package on its own (given its size) and impossible to review (and possibly maintain) in detail.

What I liked about powerplantmatching so far is that it took in some pre-processed data without the need to connect to APIs etc. I think it's worth considering the option to outsource this to another repository and then work with a pre-processed extract of OSM data. Keeps the scope of powerplantmatching perhaps a bit smaller.

lkstrp

What I liked about powerplantmatching so far is that it took in some pre-processed data without the need to connect to APIs etc. I think it's worth considering the option to outsource this to another repository and then work with a pre-processed extract of OSM data. Keeps the scope of powerplantmatching perhaps a bit smaller.

I agree, provided this remains as bloated as it is. @cdgaete, I'm afraid the code and comments in this PR are mostly copied and pasted agent output. They're very verbose, contain a lot of meaningless stuff and could be summarised to 1/10th of length. I always expected this to get cleaned up further. The feature might be cool, but I don't see it anywhere near getting merged in ppm.

If this get's cleaned up, I can add a lot of more review comments (next to those of @maurerle, which haven't been resolved yet) and many of those are obvious. But this will need more work. Otherwise we can bring that to another repo, and import just the dataset to ppm, as proposed by fneum.

cdgaete · 2025-11-10T15:09:41Z

Thanks for the detailed feedback.

You're right - the OSM implementation has grown into a package shape. What started as a straightforward data source integration revealed OSM's complexity: community-built data with numerous edge cases requiring individual handling to extract reliable power plant information.

Regarding the code: while I used AI assistance for code and documentation, I'm the one architecting and refining every implementation detail. The scope is genuinely large - ~8k lines for OSM core logic, ~9k for the EU visualization, plus documentation. This isn't verbosity; it reflects the extensive feature set needed to handle OSM's data quality challenges.

The OSM includes enhancement features (cache validation, regional coverage report) that could be removed for the core integration, but might be valuable if we spin this into a separate package.

@lkstrp I carefully addressed maurerle's initial review - I believe all comments were considered and responded to. If there are specific unresolved items, please point me to them so I can check what's missing.

Given the feedback from @fneum, and @lkstrp converging on the same point, I agree the cleanest path forward is a separate repository. This would let us maintain OSM's complexity independently while providing powerplantmatching with pre-processed, clean datasets.

If we opt for this option, I can put a Zenodo link to the OSM resulting datasets and implement here in PPM the script that downloads and integrates it in the pipeline as the other datasets already do.

@FabianHofmann

fneum · 2025-11-10T15:59:12Z

That option sounds great! Thanks @cdgaete and sorry that we've been so critical!

Carlos Gaete and others added 17 commits April 3, 2025 18:20

code: setup mypy with initial configuration

8bdd558

code: resolve type errors flagged by mypy static analysis

2b4f699

code: local development ignore folders

0691b65

code: Add three more hooks

069d399

code: Auto fixes as a result of the new hooks

e61323c

code: add github action for mypy test

eec320d

Merge branch 'PyPSA:master' into refactor_code_quality

ab60e48

Merge branch 'PyPSA:master' into refactor_code_quality

71a85fb

Running case with little optimizations

21d5f1a

Merge branch 'PyPSA:master' into feature_osm_perf_improvements

af87478

Merge branch 'PyPSA:master' into feature_osm_perf_improvements

3c15f76

Feature: Modular implementation. Debugging...

e1e3148

Merge branch 'feature_osm_perf_improvements' of https://github.com/cd…

bd769de

…gaete/powerplantmatching into feature_osm_perf_improvements

code clean and debug

7a2212f

Merge branch 'PyPSA:master' into feature_osm_perf_improvements

3c79de9

Merge branch 'PyPSA:master' into feature_osm_perf_improvements

7921317

Merge branch 'feature_osm_perf_improvements' of https://github.com/cd…

f889def

…gaete/powerplantmatching into feature_osm_perf_improvements

euronion changed the title ~~Feature osm perf improvements~~ OSM improvements (features, code, performance) May 6, 2025

Carlos Gaete and others added 7 commits May 7, 2025 16:23

Rejection tracker implementation

2855d48

Merge branch 'PyPSA:master' into feature_osm_perf_improvements

a646522

Merge branch 'PyPSA:master' into feature_osm_perf_improvements

4488527

Merge branch 'feature_osm_perf_improvements' of https://github.com/cd…

8149d68

…gaete/powerplantmatching into feature_osm_perf_improvements

Debugging workflow and tags parsing

dbbff7f

Fine tune data parsing

923af74

Clean-up and optimization

7178d73

Several enhancements and examples scripts

322b314

This was referenced May 16, 2025

OSM Data Quality Enhancement Framework #242

Open

OSM Special Power Plant Categories Support #243

Open

diazr-david and others added 5 commits July 10, 2025 12:16

Update osm-module.rst

cd7b074

Update osm-module.rst

a46a541

Update osm-module.rst

4900a3c

docs: Add OSM module to release notes

1fadabb

- Document comprehensive OpenStreetMap data source with advanced processing - Highlight dual purpose for energy analysts and OSM community - Include details on caching, enhancement features, and quality control

Merge branch 'feature_osm_perf_improvements' of https://github.com/op…

614713e

…en-energy-transition/powerplantmatching into feature_osm_perf_improvements

cdgaete enabled auto-merge (squash) July 10, 2025 21:02

fneum self-requested a review July 11, 2025 07:06

lkstrp linked an issue Jul 11, 2025 that may be closed by this pull request

Adding Open Street Map (OSM) as a source #12

Open

lkstrp changed the title ~~OSM improvements (features, code, performance)~~ OpenStreetMap Integration Jul 11, 2025

cdgaete force-pushed the feature_osm_perf_improvements branch from 4f9a6f5 to 614713e Compare July 29, 2025 00:29

cdgaete force-pushed the feature_osm_perf_improvements branch from 66327d7 to 749e3fc Compare July 29, 2025 00:56

Update type hints to Python 3.10+ union syntax

2ed62b6

- Replace Optional[X] with X | None throughout OSM module - Update isinstance calls to use X | Y syntax - Apply ruff formatting for consistency with Python 3.10+ standards

fneum requested a review from Copilot July 31, 2025 07:28

Copilot AI reviewed Jul 31, 2025

View reviewed changes

maurerle suggested changes Aug 3, 2025

View reviewed changes

Carlos Gaete added 2 commits September 26, 2025 01:15

Fix: Correct number of EU countries in config

ef01699

brynpickering reviewed Oct 7, 2025

View reviewed changes

doc/_static/europe_power_plants_osm_comparison.html

Copy link

brynpickering Oct 7, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@cdgaete could you use scatter clustering for easier viewing of your map?

lkstrp requested changes Nov 10, 2025

View reviewed changes

euronion mentioned this pull request Nov 13, 2025

Add DUKES table 5.11 power plants #270

Open

	self.api_url, data={"data": query}, timeout=self.timeout + 30
	self.api_url, data={"data": query}, timeout=min(self.timeout + 30, MAX_TIMEOUT)


		total_elements = total_plants + total_generators

		if check_live_counts:

OpenStreetMap Integration #236

Are you sure you want to change the base?

OpenStreetMap Integration #236

Uh oh!

Conversation

cdgaete commented Apr 23, 2025 • edited by lkstrp Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

cdgaete commented May 12, 2025

OSM Integration Example

Example Script:

Results:

Uh oh!

cdgaete commented May 16, 2025

Next Steps for OSM Implementation

Data Validation Across Multiple Countries

Documentation and Tutorials

Uh oh!

FabianHofmann commented Jul 23, 2025

Uh oh!

lkstrp commented Jul 31, 2025

Uh oh!

fneum commented Jul 31, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Copilot AI Jul 31, 2025

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Copilot AI Jul 31, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Jul 31, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Jul 31, 2025

Choose a reason for hiding this comment

Uh oh!

FabianHofmann commented Jul 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

maurerle left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

cdgaete commented Sep 26, 2025

Response to Review Feedback

@maurerle's Review Comments

@copilot Comments

Additional Architectural Improvements

Comments Evaluated but Not Implemented

Viewing the EU Analysis Report

Uh oh!

Choose a reason for hiding this comment

Uh oh!

FabianHofmann commented Nov 10, 2025

cdgaete commented Apr 23, 2025 •

edited by lkstrp

Loading

FabianHofmann commented Jul 31, 2025 •

edited

Loading

fneum commented Nov 10, 2025 •

edited

Loading

cdgaete commented Nov 10, 2025 •

edited

Loading