Duplicate Entries in Gazetteers jordan Hastings Department of

  • Slides: 26
Download presentation
“Duplicate” Entries in Gazetteers jordan Hastings Department of Geography University of California Santa Barbara

“Duplicate” Entries in Gazetteers jordan Hastings Department of Geography University of California Santa Barbara

Gazetteer “Duplicates” Names & Features (1) n Naming Features in the Environment n n

Gazetteer “Duplicates” Names & Features (1) n Naming Features in the Environment n n Linguistic Necessity Identity and Ownership Navigation and Wayfinding Features Cover a Large Territory n n n Crisp or Diffuse Compact or Extended Tangible or Abstract

Gazetteer “Duplicates” Names & Features (2) n Locations are Numerous & Various n n

Gazetteer “Duplicates” Names & Features (2) n Locations are Numerous & Various n n Multiscale Generalized Dis-coordinated Time-variant

Gazetteer “Duplicates” Names & Features (3) n Names are Numerous & Various n n

Gazetteer “Duplicates” Names & Features (3) n Names are Numerous & Various n n Polynymous Mis-spelled Multilingual Time-variant

Gazetteer “Duplicates” Names & Features (4) Lake Bigler, thru 1920 s Lake Bonpland (also

Gazetteer “Duplicates” Names & Features (4) Lake Bigler, thru 1920 s Lake Bonpland (also Bondland), thru 1890 s Da-ow-a-ga, thru 1850 s

Gazetteer “Duplicates” Feature Types (1) n Dependable Type System n n n Because Features

Gazetteer “Duplicates” Feature Types (1) n Dependable Type System n n n Because Features are “Objects” Because Human Mind Categorizes Types present in Taxonomy n n Hierarchy is Natural in Environment Because Human Mind Categorizes

Gazetteer “Duplicates” Feature Types (2) – Examples Cultural Environment n Nations -> States ->

Gazetteer “Duplicates” Feature Types (2) – Examples Cultural Environment n Nations -> States -> Provinces -> Districts

Gazetteer “Duplicates” Feature Types (2) - Examples n Physical Environment n Watersources: Springs-->Seeps n

Gazetteer “Duplicates” Feature Types (2) - Examples n Physical Environment n Watersources: Springs-->Seeps n Watercourses: Rivers-->Streams-->Creeks n Waterbodies: Lakes-->Ponds-->Sloughs ? Glaciers

Gazetteer “Duplicates” Fundaments (1) n n Definition: Gazetteer A spatial dictionary of named &

Gazetteer “Duplicates” Fundaments (1) n n Definition: Gazetteer A spatial dictionary of named & typed features in the environment Implications n n n Features uniquely identified Searchable by name and type Also searchable geospatially

Gazetteer “Duplicates” Fundaments (2) n Duplicates: An approximate notion n n Firm types, ±close

Gazetteer “Duplicates” Fundaments (2) n Duplicates: An approximate notion n n Firm types, ±close in hierarchy Locations ±close dependent on scale Names ±close dependent on language … or not at all All aspects variant in time

Gazetteer “Duplicates” Fundaments (3) n Database Implications / Support n Custom Datatypes n n

Gazetteer “Duplicates” Fundaments (3) n Database Implications / Support n Custom Datatypes n n n Multiple Attribution (unlimited) n n n Hierarchy Geometry Names Locations Efficient Geospatial Processing

Gazetteer “Duplicates” Approach (1) n Independent Measures of Duplicates n 1. Type Thesaurus Metrics

Gazetteer “Duplicates” Approach (1) n Independent Measures of Duplicates n 1. Type Thesaurus Metrics n n 2. Geospatial Metrics n n n Inter-feature: hierarchy, explicit linkages Intra-feature: size, compactness, … Inter-feature: distance, overlap, … 3. Geonomial Metrics n n Intra-feature: NL translation [not considered yet] Intra-feature: stemming, soundex, substitution

Gazetteer “Duplicates” Approach (2) n Unified Assessment of Duplicates n Weighted Combination of Measures

Gazetteer “Duplicates” Approach (2) n Unified Assessment of Duplicates n Weighted Combination of Measures n n n 1 Type 2 Location(s) 3 Name(s) Geographic Visualization, over Maps Final Authority of Human Cataloger

Gazetteer “Duplicates” Processing Cycle random features prep grouped features rework

Gazetteer “Duplicates” Processing Cycle random features prep grouped features rework

Gazetteer “Duplicates” Processing Cycle random features prep grouped features rework

Gazetteer “Duplicates” Processing Cycle random features prep grouped features rework

Gazetteer “Duplicates” Processing Cycle random features prep grouped features weigh accepted suspended feature database

Gazetteer “Duplicates” Processing Cycle random features prep grouped features weigh accepted suspended feature database

Gazetteer “Duplicates” Processing Cycle random features prep grouped features weigh accepted review feature database

Gazetteer “Duplicates” Processing Cycle random features prep grouped features weigh accepted review feature database suspended

Gazetteer “Duplicates” Processing Cycle random features prep grouped features weigh review accepted post feature

Gazetteer “Duplicates” Processing Cycle random features prep grouped features weigh review accepted post feature rework suspended reject database trash

[end]

[end]