Lecture 10 Creating and Maintaining Geographic Databases

Lecture 10 Creating and Maintaining Geographic Databases

The Role of Error Map and attribute data errors are the data producer's responsibility, GIS user must understand error. Accuracy and precision of map and attribute data in a GIS affect all other operations, especially when maps are compared across scales.

Accuracy closeness to TRUE values results, computations, or estimates compromise on infinite complexity generalization of the real world difficult to identify a TRUE value e.g., accuracy of a contour

Does not exist in real world Compare to other sources Accuracy (cont.) accuracy of the database = accuracy of the products computed from database e.g., accuracy of a slope, aspect, or watershed computed from a DEM

Positional Accuracy typical UTM coordinate pair might be: Easting 579124.349 m Northing 5194732.247 m If the database was digitized from a 1:24,000 map sheet, the last four digits in each coordinate (units, tenths, hundredths, thousandths) would be questionable

Testing Positional Accuracy Use an independent source of higher accuracy: find a larger scale map (cartographically speaking) use GPS Use internal evidence: digitized polygons that are unclosed,

lines that overshoot or undershoot nodes, etc. are indications of inaccuracy sizes of gaps, overshoots, etc. may be a measure of positional accuracy Precision not the same as accuracy! repeatability vs. truth not closeness of results, but number of

decimal places or significant digits in a measurement A GIS works at high precision, usually much higher than the accuracy of the data themselves Accuracy vs. Precision High Accuracy

Low Precision Darts are near the bullseye (the "true value"), but there aren't very many clusters of them (not reproduceable). Low Accuracy High Precision Many darts in

reproduceable clusters, but not in the bullseye. Accuracy vs. Precision High Accuracy Low Precision Darts are near the bullseye (the "true value"), but there

aren't very many clusters of them (not reproduceable). Low Accuracy High Precision Many darts in reproduceable clusters, but not in the bullseye.

Components of Data Quality positional accuracy attribute accuracy logical consistency completeness lineage Lecture 10 Geographic Databases

Gateway to Spatial Analysis Chapter 10 up to 10.4, Longley et al. Definitions Database an integrated set of attributes on a particular subject Geographic (=geospatial) database set of attributes on a particular subject for a particular

geographic area Database Management System (DBMS) software to create, maintain and access databases A GIS can answer the question: What is where? WHAT: Characteristics of features (= attributes).

WHERE: In geographic space. A GIS links attribute and spatial data Attribute Data Flat File or DBMS Relationships Topology Table

Map Data Point File Line File Area File Topology Type Flat File or DBMS Attribute

Attribute Attribute Record Value

Value Value Record Value Value

Value Record Value Value

Value Types of DBMS Models Hierarchical Network Relational - RDBMS Object-oriented - OODBMS Object-relational - ORDBMS

Historically, databases were structured hierarchically in flat files... Relational Databases rule now 2/1/98 2/4/98

Role of DBMS System Task Geographic Information System

Data loading Editing

Visualization Mapping Analysis Database Management System

Storage Indexing Security Query

Data Programmable API Relational DBMS (1) Data stored as tuples (tup-el), conceptualized as tables Table data about a class of objects

Two-dimensional list (array) Rows = objects Columns = object states (properties, attributes) Table Row = object Vector feature

Column = attribute Relational DBMS (2) Most popular type of DBMS Over 95% of data in DBMS is in RDBMS Commercial systems Microsoft Access

Microsoft SQL Server Oracle IBM DB2 Informix Sybase Relational Join Fundamental query operation Occurs because

Data created/maintained by different users, but integration needed for queries Table joins use common keys (column values) Table (attribute) join concept has been extended to geographic case Relational Databases

2/1/98 2/4/98 Parts of GIS database tables for U.S states (A) STATES table; (B) POPULATION table Parts of GIS database tables for U.S states (C) joined tableCOMBINED STATES and POPULATION

(C) data partially normalized into three subtables Tax assessment database (D) joined table SQL

Structured (Standard) Query Language (pronounced SEQUEL) Developed by IBM in 1970s Standard for accessing relational databases Three types of usage Stand alone queries High level programming Embedded in other applications

Types of SQL Statements Data Definition Language (DDL) Create, alter and delete data CREATE TABLE, CREATE INDEX Data Manipulation Language (DML) Retrieve and manipulate data SELECT, UPDATE, DELETE, INSERT

Data Control Languages (DCL) Control security of data GRANT, CREATE USER, DROP USER Spatial Query/Search & Retrieval: Gateway to Spatial Analysis Overlay is a spatial retrieval operation that is equivalent to an

attribute join. Buffering is a spatial retrieval around points, lines, or areas based on distance. Overlay Image courtesy of K. Foote/M. Lynch, UT-Austin

Overlay like an attribute join 2/1/98 2/4/98 Types of overlay operations Union Intersect Identity

Max Min Etc. Union computes the geometric intersection of two polygon coverages. All polygons from both coverages will be split at their intersections and preserved in the

output coverage. Union within 25 miles of a city OR within 25 miles of a major river.

Intersect computes the geometric intersection of two coverages. Only those features in the area common to both coverages will be preserved in the output coverage. Intersect within 25 miles of

a city AND within 25 miles of a major river. Identity computes the geometric intersection of two coverages. All features of the input coverage, as well as those features of the identity coverage that overlap the input coverage, are preserved in the output coverage.

Identity Portion of the major city buffer WITHIN the major river buffer within 25 miles of a city OR within 25 miles of a major river. Union within 25 miles of a city AND within 25 miles of a major river.

Intersect Intersect Identity Raster Retrieval: Map Algebra Raster overlay

Combinations of spatial and attribute queries can build some complex and powerful GIS operations. Input Grid A Input Grid B Compared

with Output Grid C Buffer Recode OR

And many more . See spatial analysis handout on course web site.

Recently Viewed Presentations

  • Invertebrate Zoology Lecture 7: Phylum Cnidaria, Part 2

    Invertebrate Zoology Lecture 7: Phylum Cnidaria, Part 2

    Invertebrate Zoology Lecture 7: Phylum Cnidaria, Part 2 Class Hydrozoa Class Hydrozoa Order Hydroidea: the hydroids Polyp form often colonial, branched (Obelia) Includes feeding and reproductive polyps Coenosarc: common gastrovascular cavity Class Hydrozoa Order Hydroidea: life cycle (Obelia sp.) Planula...
  • BPS States - Rutgers Physics & Astronomy

    BPS States - Rutgers Physics & Astronomy

    BPS News(Half)Hour . Gregory Moore, Rutgers University. Today I will report on the headlines . for a few projects . The common theme in all these projects is the BPS spectrum of N=2 supersymmetric field theories in four- and in...
  • Humorous Poetry and Song Lyrics by Don and

    Humorous Poetry and Song Lyrics by Don and

    The only other sound's the sweep Of easy wind and downy flake. The woods are lovely, dark and deep. But I have promises to keep And miles to go before I sleep. And miles to go before I sleep. *...
  • Blue Ribbon Run/Walk 5K - Life Science Academy

    Blue Ribbon Run/Walk 5K - Life Science Academy

    Blue Ribbon Run/Walk 5K. Jack Fuller, Carly Kirk, Rayvin Ewers, Naomi Belcher, Miranda McCormick
  • ELA with Miss Jordan

    ELA with Miss Jordan

    The short story is short because of the following things: Very short exposition. No details about the setting, only the essentials, the most necessary information. The story starts shortly before the climax. Small number of characters. No development of the...
  • Powerful Ways to Engage Students Using Google Classroom

    Powerful Ways to Engage Students Using Google Classroom

    One stop shop for main course materials. Create Announcements, Assignments, and Questions ... Primary Teacher. Secondary Teacher. Add materials such as: links, Drive files, YouTube videos and attachments ... Post to more than one class! Add topics to organize your...
  • Using the Semantic Web to Improve Knowledge of Translations

    Using the Semantic Web to Improve Knowledge of Translations

    Presentation given at the International Conference on Dublin Core and Metadata Applications 2017 in Washington DC on 26 October. It describes OCLC Research's experiment to improve the descriptions of translations, the way we learn about other cultures and other cultures...
  • Applied Epidemiologic Analysis Patricia Cohen, Ph.D. Henian Chen,

    Applied Epidemiologic Analysis Patricia Cohen, Ph.D. Henian Chen,

    Applied Epidemiologic Analysis Patricia Cohen, Ph.D. Henian Chen, M.D., Ph. D. Teaching Assistants Julie Kranick Sylvia Taylor Chelsea Morroni Judith Weissman