• Non ci sono risultati.

Support for massive spatial datasets in GRASS GIS

N/A
N/A
Protected

Academic year: 2021

Condividi "Support for massive spatial datasets in GRASS GIS"

Copied!
33
0
0

Testo completo

(1)

Support for massive

spatial datasets

in GRASS GIS

Markus Metz

(2)

Massive spatial datasets

Large is relative to

RAM disk space processing time

(3)

Massive spatial datasets

Examples

Global Administrative Areas (1.3 GB) OpenStreetMap (20 GB)

Corine Land Cover (vector version, 4.4 GB) LiDAR (open ended size)

(4)
(5)
(6)
(7)

Massive spatial datasets

Particularly problematic for

• Least cost searches • Hydrological modelling

• Image reprojection and (ortho-)rectification • Vector topology

(8)

Data processing concepts

Raster

Only few processes need to load the full raster at once

Scattered (random) access Sweep-lines

Sorting and searching

Vector

loading the full vector map to memory is probably never necessary

BUT: processing time can be long,

(9)

Data processing concepts

Raster

(10)

Data processing concepts

Raster

Scattered (random) access -> never happens Sweep-lines: e.g. hydrology, cost surfaces

(11)
(12)
(13)

Data processing concepts

Raster

Scattered (random) access -> never happens Sweep-lines: e.g. hydrology, cost surfaces → Sorting and searching

(14)

Massive spatial datasets

Solutions

NEVER load the full raster to memory

Tiling

external memory

Fast sorting and searching

Large File Support (LFS), files > 2 GB

(15)

What is GRASS GIS ?

Geographic Resources Analysis Support System

Free and Open Source since 1984

Member of

Linked to GDAL, GIS and

(16)

GRASS vector model

Vector geometry types

Point

Centroid Line

Boundary

Area (Boundaries + Centroid) Face (3D Area)

[Kernel (3D Centroid)]

[Volumes (Faces + Kernel)] All types are true 3D: x, y, z

Node Node Vertex Vertex Segm ent Segment Segm ent Node Node Centroid Centroid Vertex Vertex Vertex Vertex Vertex V e rt e x Boundary B ou nd ary B o u n d a ry Line Area

(17)

OGC Simple Features vs

Vector Topology

OGC Simple Features

points, lines, polygons

-> replicated boundaries for adjacent areas

faster computations, but extra work for maintenance

Switzerland gaps

slivers Non-topological

(18)

Vector Topology

areas are constructed from boundaries

boundaries are shared between adjacent areas

slower computations, but less (nearly no manual) maintenance

OGC Simple Features vs

Vector Topology

Switzerland Topological boundaries generalized original generalized

(19)

Data processing in GRASS GIS

Raster

Developed in the 1980's row by row default

segmented reprojection, interpolation, hydrology, ... sorted terraflow, viewshed

Vector

Mix of external memory and all in memory (topology)

network analysis LiDAR point clouds

(20)

Massive datasets in GRASS 7

Improved raster tiling

Improved external memory, raster and vector

Fast sorting and searching

(21)

Massive datasets in GRASS 7

(22)

Massive datasets in GRASS 7

Hydrology: r.watershed

(memory + processing time)

0 50 100 150 200 250 300 0.01 0.1 1 10 100 1000 10000 old r.watershed new r.watershed r.terraflow

grid size [millions of cells]

tim e [ m in ] ?

(23)

Massive datasets in GRASS 7

Cost surfaces: r.cost

4.5 million cells, 160 start points

GRASS 6.4 GRASS 7.0 0 20 40 60 80 100 120 140 tim e [s ec ]

(24)

Massive datasets in GRASS 7

Cost surfaces: r.cost

4.5 million cells, 160 start points

GRASS 6.4 GRASS 7.0 0 20 40 60 80 100 120 140 tim e [s ec ]

(25)

Massive datasets in GRASS 7

B-spline raster interpolation

(26)

Massive datasets in GRASS 7

(27)

Vector topology: new spatial index

Massive datasets in GRASS 7

(28)

Massive datasets in GRASS 7

Topological cleaning, e.g. vector polygon import

GADM, all levels, global

GRASS 6.4 GRASS 7.0 GRASS 7.0 0 20 40 60 80 100 120 0 1 2 3 4 5 6 7 timeRAM tim e [m in ] G B R A M experimental

(29)

Massive datasets in GRASS 7

(30)

Massive datasets in GRASS 7

Vector distance to nearest feature

Old:

(n features in from) x (n features in to) New:

Best case: (n features in from) Worst case: as old version

(31)

Massive datasets in GRASS 7

TODO

experimental -> stable

Further reduce memory requirements (work in progress) [Bulk of the work: General optimisation of modules]

(32)

Massive datasets in GRASS 7

Summary

Raster

Improved external memory for rasters Selected raster modules speed-up

Vector

(33)

Riferimenti

Documenti correlati

The article examines the notion of untruth in relation to photography and illustrates the function of the photograph as a stylistic and linguistic device that explains the idea of

22 Les traits de ce drame sont réunis dans la gure de la prostituée : la xation du corps de la femme sous forme de marchandise qui se prête au désir masculin pétri ant, la

La poetica della contraddizione ha accomunato questi teatranti eccezionali attra- verso l’elaborazione di un particolare linguaggio della scena dal tratto fortemente allegorico

se da quella di ‘scrittrice lesbica’ passiamo ad analizzare la categoria della ‘tematica lesbica’, si pongono ovvi problemi di esclusione, che riguar- dano scrittrici

In this study, the attention is focused on ETs present in Rubus berries, raspberries and blackberries, to verify whether they are able to protect against gastric ulcer and how

CONTENTS INTRODUCTION Fear in international relations and in European integration Paolo Caraffini and Filippo Maria Giordano ESSAYS • Changing fears in European geopolitical

Use of all other works requires consent of the right holder (author or publisher) if not exempted from copyright protection by the