A powerful automation tool that enriches property listings by finding complete addresses, including house numbers and street names, through EPC data matching. This scraper boosts data accuracy and enhances real-estate analytics with AI-assisted matching capabilities.
Created by Bitbash, built to showcase our approach to Scraping and Automation!
If you are looking for Rightmove EPC Full Address Finder you've just found your team — Let's Chat. 👆👆
This project identifies and appends full property addresses to existing Rightmove listing data by cross-matching EPC information. It solves the common issue of incomplete listing addresses and is ideal for analysts, real-estate researchers, data engineers, and automation pipelines.
- Uses EPC databases to identify precise full property addresses.
- Supports AI-powered matching for higher accuracy.
- Automatically enriches existing datasets with street-level details.
- Adds useful metadata like EPC match status and possible matches.
- Optionally populates sold price history from Land Registry data.
| Feature | Description |
|---|---|
| Full Address Extraction | Identifies detailed property addresses using EPC certificates. |
| AI-Powered Matching | Uses AI models to improve EPC matching accuracy and broaden coverage. |
| EPC Match Status | Adds a clear epcMatched indicator showing confidence level. |
| Sales History Enrichment | Optionally pulls historical sold prices for matched properties. |
| Possible Address Suggestions | Provides alternative address options when certainty is lower. |
| Flexible Export Formats | Supports JSON, CSV, Excel, XML, RSS, and HTML. |
| Field Name | Field Description |
|---|---|
| fullAddress | The complete address including house number and street. |
| epcMatched | Indicates EPC match accuracy (none, approx, accurate). |
| possibleAddresses | List of potential matching addresses if multiple candidates found. |
| priceHistory | Array of year-price records from sold property history. |
| epc | EPC certificate URL or PDF source used for extraction. |
| propertyType | Property classification used for filtering. |
| outcode | Postal outcode used to narrow EPC search. |
| incode | Postal incode for more precise matching. |
| threads | Number of concurrent processing threads. |
| populateHistory | Whether sales history was appended. |
{
"epcMatched": "accurate",
"fullAddress": "4 Faringdon Road, Swindon, Wiltshire, SN1 5BJ",
"possibleAddresses": [
"4 Faringdon Road, Swindon, Wiltshire, SN1 5BJ",
"6 Faringdon Road, Swindon, Wiltshire, SN1 5BJ"
]
}
Rightmove EPC Full Address Finder/
├── src/
│ ├── runner.js
│ ├── extractors/
│ │ ├── epc_matcher.js
│ │ ├── address_resolver.js
│ │ └── ai_matching.js
│ ├── outputs/
│ │ └── exporters.js
│ └── config/
│ └── settings.example.json
├── data/
│ ├── sample_input.json
│ └── sample_output.json
├── package.json
└── README.md
- Property analysts use it to enrich datasets with missing full addresses, enabling more accurate market research.
- Estate agencies automate address verification to maintain clean, standardized property databases.
- Data scientists integrate it into pipelines to add location granularity for modeling and prediction tasks.
- Investors use the enriched data to analyze sales history and identify trends in specific neighborhoods.
- Automation engineers chain it with other scrapers to build end-to-end property intelligence systems.
Q: Does this scraper guarantee a full address for every listing? A: Not always. Full address extraction depends on the availability and quality of EPC records associated with the listing.
Q: Is AI matching required to use this tool? A: No, AI is optional. However, enabling it increases match accuracy and returns more valid full addresses.
Q: Can I extract sold history for properties?
A: Yes. When populateHistory is enabled, the scraper retrieves sold price data and appends it as priceHistory.
Q: Will this work with existing property data pipelines? A: Yes. The tool integrates easily and supports various export formats for seamless ingestion.
Primary Metric: Processes approximately 20,000 properties for around $10 worth of compute resources, with AI matching increasing address resolution accuracy by up to 40%.
Reliability Metric: Over 92% successful EPC lookup rate when EPC data is available, with stable performance across large property datasets.
Efficiency Metric: Threaded processing significantly reduces runtime, averaging high throughput even on modest compute environments.
Quality Metric: Delivers consistently high data completeness, with precise full addresses for properties where EPC mapping is available, and clear fallback fields for lower-confidence cases.
