Skip to content

ATLAS (All The Locations of All Servers) - Global data center mapping project with 6,266+ verified locations across 155 countries. Comprehensive OSINT dataset with interactive world map and geospatial intelligence.

License

Notifications You must be signed in to change notification settings

hackersloveme35/ATLAS

ย 
ย 

Repository files navigation

ATLAS

All The Locations of All Servers

๐ŸŒ LIVE INTERACTIVE MAP ๐ŸŒ

A comprehensive global data center intelligence system mapping the world's critical infrastructure.

Project Origin

This OSINT scraping tool was developed as part of a critical infrastructure mapping initiative for a commercial geospatial intelligence platform. Our company was building an advanced location-based system utilizing cellular coordinates and infrastructure data to power next-generation mapping and analytics services.

Mission

Objective: Map every data center on Earth.

As critical infrastructure nodes forming the backbone of the global internet, data centers represent essential geolocation targets for our commercial database. This tool was engineered to systematically discover, validate, and extract GPS coordinates for data center facilities worldwide through open-source intelligence gathering.

What It Does

The scraper autonomously harvests data center locations from public web sources, extracting:

  • Facility names and operators
  • Physical addresses
  • Geographic coordinates
  • Infrastructure classifications

The resulting dataset populates a specialized layer within our geospatial database, enabling critical infrastructure analysis, network topology mapping, and location-based services for commercial applications.

Dataset

The scraped intelligence includes 6,266 verified data center locations across 155 countries, operated by 2,508 companies. This represents one of the most comprehensive open-source data center location databases ever compiled.

โšก Latest Enhancements (v2.2)

๐Ÿš€ Cloudflare R2 + Workers API (v2.2)

Performance & Architecture Upgrade:

  • ๐Ÿ“ฆ Migrated 1.8MB database to Cloudflare R2 object storage
  • โšก Smart filtering API reduces page loads from 1.8MB โ†’ ~100KB (18x faster)
  • ๐ŸŒ Edge caching via Cloudflare Workers for global low-latency access
  • ๐Ÿ”Œ RESTful API endpoints: /api/all, /api/search, /api/country, /api/stats
  • ๐Ÿ’ฐ Cost-effective: Nearly free operation on Cloudflare free tier
  • ๐Ÿ”’ CORS-enabled for seamless browser integration

๐ŸŽจ UI Optimization & Professional Design (v2.2)

Compact, Clean Interface:

  • ๐Ÿ“ 40% reduction in UI footprint - headers, panels, buttons all optimized
  • ๐ŸŽฏ Two-column layout: Search filters (left) | Action buttons (right)
  • ๐Ÿ“ Consistent sizing: 6px/10px padding, 11-12px fonts throughout
  • ๐Ÿ”ฒ Subtle 2px border-radius on all UI elements
  • ๐Ÿ–ผ๏ธ Clean map view: Hidden Leaflet attribution for distraction-free experience
  • โšก Better UX: Larger search box (320px), organized controls

๐Ÿ“ธ Screenshot Export with Embedded Metadata (v2.2)

Professional Map Capture:

  • ๐Ÿ–ผ๏ธ High-resolution export: 2x scale capture for crisp quality
  • ๐Ÿท๏ธ Embedded watermark: "by ringmast4r" branding in every screenshot
  • ๐Ÿ“Š Auto-embedded data: Facility count, coordinates, zoom, timestamp, URL
  • ๐Ÿ’พ PNG format: Maximum quality (1.0) with html2canvas library
  • ๐Ÿ” Hidden feature: Users just see "Screenshot" button, data embeds automatically

๐Ÿ” Massive SEO Optimization (v2.2)

Search Engine Domination:

  • ๐ŸŽฏ 500+ strategic keywords: data centers, cloud infrastructure, colocation, edge computing, CDN, etc.
  • ๐Ÿข All major providers: AWS, Azure, Google Cloud, Equinix, Digital Realty, etc.
  • ๐Ÿ“ Global tech hubs: Silicon Valley, London, Singapore, Tokyo, Sydney, Amsterdam, etc.
  • ๐Ÿ“ฑ Open Graph + Twitter Cards: Perfect social media sharing
  • โญ Schema.org structured data: Rich search results with 4.9โ˜… rating
  • ๐Ÿค– Multi-engine optimization: Google, Bing, with specialized bot directives

โšก Previous Enhancements (v2.0-2.1)

๐Ÿ”ฌ Advanced Data Cleaning & Optimization

Critical Issues Discovered & Fixed:

  • ๐Ÿ” 970 entries (15.5%) had missing country fields โ†’ Fixed 707 via intelligent address parsing (73% improvement)
  • ๐Ÿ” 1,708 US entries (83.5%) lacked state information โ†’ Added 1,434 via ZIP code geocoding
  • ๐Ÿ” Only 36.6% (2,292) had precise city-level coordinates
  • โœ… Result: Reduced invalid plotting by 73% (970 โ†’ 263 remaining)

Automated Data Cleaning Script:

  • Expanded country dictionary to 200+ countries/territories (UN members + territories)
  • Comprehensive country alias system (USAโ†’United States, UKโ†’United Kingdom, Nederlandโ†’Netherlands, etc.)
  • Special character handling (Cรดte d'Ivoire, etc.)
  • Multi-word country name support (United Arab Emirates, South Korea, etc.)
  • US state geocoding via ZIP code ranges (all 50 states + DC)
  • Coordinate validation (lat: -90 to 90, lon: -180 to 180)
  • Created optimized datacenters_cleaned.json database

๐ŸŒ Batch Geocoding & Coordinate Accuracy (v2.1)

Major Accuracy Improvement:

  • ๐ŸŽฏ Batch geocoded 3,973 facilities using OpenStreetMap Nominatim API
  • ๐Ÿ“ Added 2,119 precise coordinates (53% success rate)
  • ๐Ÿ“ˆ Coverage increased: 36.6% โ†’ 70.4% (2,293 โ†’ 4,412 facilities with coordinates)
  • ๐Ÿ”ง Fixed critical coordinate errors:
    • 261 Southern Hemisphere facilities with inverted latitude signs
    • 3 Australia facilities with UK/Europe coordinates
    • 6 facilities with completely wrong city coordinates
    • Hawaii & Iceland state/country fallback coordinate corrections

Geocoding Methodology:

  • Free OpenStreetMap Nominatim API (no cost, no API key required)
  • Address-to-coordinate conversion with ~100m accuracy
  • Country-specific boundary validation
  • Automatic retry logic and error handling
  • Rate-limited to 1 request/second (API compliance)

Result: Facilities now plot at actual street addresses instead of state/country centers (50-200 miles more accurate)

๐Ÿ“Š Interactive Statistics Dashboard

  • Real-time Analytics with Chart.js integration
  • Top 10 Countries - Bar chart showing facility distribution
  • Top 10 Operators - Bar chart of leading data center providers
  • Data Quality Metrics - Doughnut chart displaying coordinate precision stats
  • Regional Distribution - Pie chart of global infrastructure spread
  • Live Metrics - Key stats updating dynamically based on filtered data
  • Collapsible Panel - Matrix-themed UI with smooth animations

๐ŸŽจ Multi-Theme Support (5 Themes)

  • ๐ŸŸข Dark Matrix (CartoDB Dark) - Default cybersecurity aesthetic
  • โ˜€๏ธ Light Professional (OpenStreetMap) - Clean business presentation
  • ๐Ÿ›ฐ๏ธ Satellite View (ESRI World Imagery) - Real satellite photography
  • ๐Ÿ—บ๏ธ Topographic (OpenTopoMap) - Terrain and elevation mapping
  • ๐ŸŒŠ Ocean Navigation (ESRI Ocean Base) - Maritime infrastructure focus
  • CSS variable system for dynamic UI theming across all components
  • LocalStorage persistence - remembers user preference

๐Ÿ› ๏ธ Advanced Geospatial Analysis Tools

๐Ÿ“ Radius Search

  • Interactive circle drawing with click-to-place center point
  • Customizable radius (1-10,000 miles or kilometers)
  • Haversine distance calculations for accurate great-circle distances
  • Real-time facility discovery within radius
  • Results sorted by distance with top 10 display
  • Visual circle overlay with adjustable parameters

๐Ÿ“ Distance Calculator

  • Measure precise distance between any two points on the map
  • Click-to-place Point A and Point B markers
  • Support for 3 unit types: Miles, Kilometers, Nautical Miles
  • Visual line drawing between points with dashed styling
  • Real-time unit conversion
  • Auto-zoom to fit both points in view

๐ŸŽฏ Proximity Analysis

  • Click any facility to find N nearest neighbors (1-50)
  • Haversine-based distance ranking
  • Visual connection lines (top 3 highlighted in green, others in yellow)
  • Detailed neighbor information (name, company, location, distance)
  • Configurable neighbor count and distance units
  • Auto-zoom to display all related facilities

๐Ÿ’พ Export Functionality

  • CSV Export - Properly escaped, Excel-compatible format
  • JSON Export - Structured data for API/application integration
  • GeoJSON Export - Geographic format with coordinates for GIS tools
  • ๐Ÿ“ธ Screenshot Export - High-res 2x PNG with embedded metadata (facility count, coordinates, timestamp, ringmast4r branding)
  • Smart coordinate fallback system (city โ†’ state โ†’ country)
  • Timestamped filenames for version tracking
  • Export visible/filtered results only

๐Ÿš€ Performance & Visualization

  • Marker Clustering - Smart proximity grouping for 6,266 markers (dramatic performance boost)
  • Heatmap Layer - Infrastructure density visualization with Matrix-style gradient
  • Precision Mapping - Eliminated ocean/null island plotting (0,0 coordinates)
  • Enhanced UI - Matrix-themed cluster bubbles with dynamic sizing
  • Instant Filtering - Real-time search across 6,266 entries
  • Live Statistics - Dynamic country/facility count updates

Quick Stats

  • Total Data Centers: 6,266
  • Countries Covered: 155
  • Companies Tracked: 2,508
  • Top Country: United States (2,070 facilities - 33%)
  • Top Operator: Equinix (177 facilities)

See STATISTICS.md for detailed breakdowns and regional analysis.

๐ŸŽฎ How to Use the Interactive Map

Basic Navigation:

  • Search Bar - Type facility name, company, city, state, or country
  • Filters - Use dropdown menus to filter by country or company
  • Zoom - Scroll or pinch to zoom, click clusters to expand

Advanced Tools (Toolbar Buttons):

  1. ๐Ÿ”ฅ Toggle Heatmap - Switch between marker view and density heatmap
  2. ๐Ÿ“Š Statistics Dashboard - View interactive charts and analytics
  3. ๐ŸŽจ Theme Selector - Choose from 5 map themes (dropdown menu)
  4. ๐Ÿ’พ Export Data - Download visible facilities in CSV, JSON, or GeoJSON format
  5. ๐Ÿ“ Radius Search - Click button โ†’ Click map โ†’ Set radius โ†’ View facilities within distance
  6. ๐Ÿ“ Distance Calculator - Click button โ†’ Click Point A โ†’ Click Point B โ†’ View distance
  7. ๐ŸŽฏ Proximity Analysis - Click button โ†’ Click any facility marker โ†’ View N nearest neighbors

Tips:

  • Click any facility marker to view detailed information
  • Use filters before exporting to get specific subsets
  • Proximity Analysis shows top 3 neighbors highlighted in green
  • Theme preference is saved automatically

Files

Data Files

  • datacenters_cleaned.json โญ NEW - Optimized dataset with country/state extraction and coordinate validation
  • datacenters_processed.csv - Processed dataset with parsed address fields (CSV format)
  • datacenters_original_scraped.csv - Original scraped data (reference)
  • datacenters.json - JSON format for API/application integration with structured address data

Interactive Tools

  • index.html - Live interactive world map with:
    • โœจ Marker Clustering - Smart proximity grouping for 6,266 facilities
    • ๐Ÿ”ฅ Heatmap Layer - Density visualization with Matrix-style gradient
    • ๐Ÿ” Advanced Search - Real-time filtering (name, company, city, state, country)
    • ๐Ÿ“Š Statistics Dashboard - 4 interactive charts (Chart.js)
    • ๐ŸŽจ Multi-Theme Support - 5 map themes (Dark Matrix, Light, Satellite, Topographic, Ocean)
    • ๐Ÿ’พ Export Tools - CSV/JSON/GeoJSON download of filtered data
    • ๐Ÿ“ Radius Search - Find facilities within customizable distance from any point
    • ๐Ÿ“ Distance Calculator - Measure between two points (miles/km/nautical miles)
    • ๐ŸŽฏ Proximity Analysis - Find N nearest neighbors to any facility
    • ๐ŸŽฏ Country/Company Filtering - Dropdown filters for precise queries
    • ๐Ÿ“ Interactive Results Panel - Click-through facility details

Utilities

  • clean_data.py โญ NEW - Data cleaning script with country/state extraction and coordinate validation

Documentation

  • STATISTICS.md - Comprehensive statistics and regional breakdowns
  • README.md - This file
  • LICENSE - Usage terms

Data Schema

CSV Format (datacenters_processed.csv):

name,company,city,administrative_area,country,address

JSON Format (datacenters.json):

[
  {
    "name": "NAP de las Americas Madrid",
    "company": "Terremark",
    "city": "Madrid",
    "country": "Spain",
    "address": "Calle de Yecora, 4 28009 Madrid Spain"
  }
]

Field Descriptions:

  • administrative_area - First-level administrative division (e.g., US states, Canadian provinces, UK counties, German Lรคnder, French rรฉgions)
  • The JSON format includes additional fields like street, state, zip, and city_coords for internal map processing

Sample Records:

NAP de las Americas Madrid,Terremark,Madrid,,Spain,"Calle de Yecora, 4 28009 Madrid Spain"
Handy Networks Denver,Handy Networks,Denver,Colorado,United States,"1801 California St, Suite 240 Denver"
Central Office 2,StarHub Ltd.,Singapore,,Singapore,19 Tai Seng Dr 535222 Singapore Singapore
Toronto,Allied Properties REIT,Toronto,Ontario,Canada,151 Front Street Toronto Canada

Usage Examples

Python - Load CSV:

import csv

with open('datacenters_processed.csv', 'r', encoding='utf-8') as f:
    reader = csv.DictReader(f)
    for row in reader:
        print(f"{row['name']} - {row['city']}, {row['country']}")

Python - Load JSON:

import json

with open('datacenters.json', 'r', encoding='utf-8') as f:
    datacenters = json.load(f)

# Filter by country
us_datacenters = [dc for dc in datacenters if dc.get('country') == 'United States']

# Filter by city
london_datacenters = [dc for dc in datacenters if dc.get('city') == 'London']

# Filter by company
equinix_facilities = [dc for dc in datacenters if 'Equinix' in dc.get('company', '')]

# Search across all fields
search_term = 'miami'
results = [dc for dc in datacenters
           if search_term.lower() in str(dc.get('address', '')).lower() or
              search_term.lower() in str(dc.get('city', '')).lower()]

JavaScript - Fetch and Query:

fetch('datacenters.json')
  .then(response => response.json())
  .then(data => {
    // Find all datacenters in a country
    const ukDatacenters = data.filter(dc => dc.country === 'United Kingdom');

    // Find all datacenters in a US state
    const californiaDatacenters = data.filter(dc => dc.state === 'California');

    // Find all datacenters in a city
    const nycDatacenters = data.filter(dc => dc.city === 'New York');

    // Get unique countries
    const countries = [...new Set(data.map(dc => dc.country))];

    // Get unique US states
    const states = [...new Set(data.filter(dc => dc.state).map(dc => dc.state))];
  });

Command Line - Query with jq:

# Count by country
jq 'group_by(.country) | map({country: .[0].country, count: length})' datacenters.json

# Count by state (US only)
jq '[.[] | select(.state != "")] | group_by(.state) | map({state: .[0].state, count: length})' datacenters.json

# Find specific operator
jq '.[] | select(.company | contains("Equinix"))' datacenters.json

# Extract all facilities in a state
jq '.[] | select(.state == "Florida")' datacenters.json

# Extract all facilities in a city
jq '.[] | select(.city == "Miami")' datacenters.json

Coverage

Global Coverage by Region:

  • North America: 2,265 facilities (36.1%)
  • Europe: 1,778+ facilities (28.4%)
  • Asia-Pacific: 783+ facilities (12.5%)
  • Africa: 179+ facilities (2.9%)
  • South America: 183+ facilities (2.9%)
  • Middle East: 86+ facilities (1.4%)

Top 10 Countries:

  1. United States (2,070)
  2. United Kingdom (461)
  3. Netherlands (296)
  4. France (261)
  5. Germany (242)
  6. Australia (181)
  7. Canada (164)
  8. India (147)
  9. Brazil (141)
  10. China (138)

License

All Rights Reserved. This dataset may not be used, copied, modified, or distributed without explicit written permission from the author.


Built for commercial geospatial intelligence. Powered by OSINT methodologies.

About

ATLAS (All The Locations of All Servers) - Global data center mapping project with 6,266+ verified locations across 155 countries. Comprehensive OSINT dataset with interactive world map and geospatial intelligence.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • HTML 78.1%
  • Python 21.9%