• Non ci sono risultati.

Data Science: history repeated?: the heritage of the Free and Open Source GIS community

N/A
N/A
Protected

Academic year: 2021

Condividi "Data Science: history repeated?: the heritage of the Free and Open Source GIS community"

Copied!
1
0
0

Testo completo

(1)

Geophysical Research Abstracts Vol. 16, EGU2014-PREVIEW, 2014 EGU General Assembly 2014

© Author(s) 2014. CC Attribution 3.0 License.

Data Science: History repeated? – The heritage of the Free and Open

Source GIS community

Peter Löwe (1) and Markus Neteler (2)

(1) Technische Informationsbibliothek TIB, Development, Hannover, Germany (peter.loewe@tib.uni-hannover.de), (2) GIS and Remote Sensing Unit, CRI-DBEM, Fondazione Edmund Mach S. Michele all’Adige, Italy

Data Science is described as the process of knowledge extraction from large data sets by means of scientific methods. The discipline draws heavily from techniques and theories from many fields, which are jointly used to furthermore develop information retrieval on structured or unstructured very large datasets. While the term Data Science was already coined in 1960, the current perception of this field places is still in the first section of the hype cycle according to Gartner, being well en route from the technology trigger stage to the peak of inflated expectations.

In our view the future development of Data Science could benefit from the analysis of experiences from related evolutionary processes. One predecessor is the area of Geographic Information Systems (GIS). The intrinsic scope of GIS is the integration and storage of spatial information from often heterogeneous sources, data analysis, sharing of reconstructed or aggregated results in visual form or via data transfer. GIS is successfully applied to process and analyse spatially referenced content in a wide and still expanding range of science areas, spanning from human and social sciences like archeology, politics and architecture to environmental and geoscientific applications, even including planetology.

This paper presents proven patterns for innovation and organisation derived from the evolution of GIS, which can be ported to Data Science. Within the GIS landscape, three strategic interacting tiers can be denoted: i) Standardisation, ii) applications based on closed-source software, without the option of access to and analysis of the implemented algorithms, and iii) Free and Open Source Software (FOSS) based on freely accessible program code enabling analysis, education and ,improvement by everyone. This paper focuses on patterns gained from the synthesis of three decades of FOSS development. We identified best-practices which evolved from long term FOSS projects, describe the role of community-driven global umbrella organisations such as OSGeo, as well as the standardization of innovative services. The main driver is the acknowledgement of a meritocratic attitude. These patterns follow evolutionary processes of establishing and maintaining a web-based democratic culture spawning new kinds of communication and projects. This culture transcends the established compartmentation and stratification of science by creating mutual benefits for the participants, irrespective of their respective research interest and standing. Adopting these best practices will enable the emerging Data Science communities to avoid pitfalls and to accelerate the progress to stages of productivity.

Riferimenti

Documenti correlati

Fondazione Edmund Mach - Centro Ricerca e Innovazione Dipartimento di Biodiversità ed Ecologia Molecolare. roberto.zorer@fmach.it

Traminer aromatico ISMA®-AVIT 920R 2 4 6 8 10 floreale fermentativo floreale aromatico Fruttato Fruttato maturo vegetale erbaceo ACIDO PERSISTENZA STRUTTURA TIPICITA'

knowledge and understandings, and in producing change and transformation (Schön, 1987). Accordingly, only a reflective position can support the teachers in figuring out the

This study investigated the effects of a 5-week training program consisting of repeated 30-m sprints, on two RSA test formats— one with one COD and the other with multiple CODs—in

In sum, a metamaterial-based foundation system is developed and stud- ied herein, with the main findings being: (i) a structure of this type is feasible under common

The semi-analytic galaxy formation model of Barausse (2012) is used to model the cosmological evolution of mas- sive black holes, which can be related to the jetted TDE co- moving

This overview will cover the kinds of research questions for which m ethods of qualitative research are best suited to answer, and will provide a sam ple of published studies

4.10 Elapsed time per executor core and strong scalability of IRON node for the code’s four parts with Spark Fat executors approach and a workload of