Loading…
This event has ended. Create your own event on Sched.
Welcome to the Earth Science Information Partners (ESIP) 2018 Summer Meeting! The 2018 theme is Realizing the Socioeconomic Value of Data. The theme is based on one of the goals in the 2015 - 2020 ESIP Strategic Plan, which provides a framework for ESIP’s activities over the next three years.

All Presentations are being added to a Google Folder temporarily and then will be moved to FigShare and linked to the sessions here. 
View analytic

Sign up or log in to bookmark your favorites and sync them to your phone or calendar.

Canyon A [clear filter]
Tuesday, July 17
 

9:30am

Governance Transition for the Data Management Training Clearinghouse Expansion Project
The collaboratively developed and maintained Data Management Training Clearinghouse (DMTC) has recently been awarded a three year Institute of Museum & Library Science (IMLS) grant as part of its National Digital Library Program. The IMLS grant will enable the DMTC to expand and enhance its web presence, and services by expanding and diversifying the content included in the clearinghouse, enhancing cataloging and classification, and developing a method to enable feedback between trainers and trainees. The DMTC has been shepherded by the ESIP Data Management Training Working Group (DMT WG) which is nearing the end of its two year cycle. This session will be a working session where current DMT WG members work with potential DMTC Advisory Board members to define the governance structure and strategies for the future scope, activities, and services of the DMTC. Following this session, a business meeting will cover the accomplishments of the DMT WG over the past two years, and launch the IMLS expansion project.

Speakers & Moderators
avatar for Karl Benedict

Karl Benedict

Director of Research Data Services, University of New Mexico
For nearly 30 years Karl Benedict has had parallel careers in Information Technology, Data Management and Analysis, and Archaeology. Over the last 22 years at UNM he has worked as a Graduate Student in Anthropology, Research Scientist, Applied Research Center Director, and currently... Read More →
avatar for Nancy Hoebelheinrich

Nancy Hoebelheinrich

Principal, Knowledge Motifs LLC
See my LinkedIn profile at: https://www.linkedin.com/in/nancy-hoebelheinrich-0576ba3
avatar for Erin Robinson

Erin Robinson

Executive Director, Earth Science Information Partners (ESIP)
Erin Robinson works at the intersection of community informatics, Earth science and non-profit management. Over the last 10 years, she has honed an eclectic skill set both technical and managerial, creating communities and programs with lasting impact around science, data, and technology... Read More →


Tuesday July 17, 2018 9:30am - 11:00am
Canyon A

11:30am

Seeking feedback from the ESIP community on two developing dataset maturity assessment models
Abstract:
In collaboration with the ESIP Data Stewardship Committee, the NCEI Use/Service Maturity Matrix (MM-Serv) Working Group has developed a complete draft maturity model for the services provided for individual datasets, where services include both automated (e.g. web services) and human-provided functions.

A complete draft of a WMO-Wide Stewardship Maturity Matrix for Climate Data (SMM-CD) has also been developed by the WMO SMM-CD Working Group in collaboration with international experts who participated in the Expert Meeting on Climate Data Modernisation at the Royal Netherlands Meteorological Institute (KNMI), 16–18 April 2018.

In this session, the latest draft version of both maturity assessment models will be introduced to the ESIP community, followed by a review/comment period in an effort to gain valuable feedback from the community.

Agenda:
Part I: Presentations
1) Opening remark & MMs status - Ge Peng
(The slides are available at: https://doi.org/10.6084/m9.figshare.6854948)

2) Introducing NCEI/ESIP-DSC MM-Serv - Ruth Duerr
(The slides are available at: https://doi.org/10.6084/m9.figshare.6854975)

3) Introducing WMO SMM-CD - Christina Lief
(The slides are available at: https://doi.org/10.6084/m9.figshare.6854993)

Part II: Working Session
Sub-group one - reviewing MM-Serv (group leads: Ge Peng, Sophie Hou, Bob Downs)
(The MM-Serv poster is available at: https://doi.org/10.6084/m9.figshare.6855020)

Sub-group two - reviewing SMM-CD (group leads: Christina Lief, Nancy Ritchey)
(The SMM-CD poster is available at: https://doi.org/10.6084/m9.figshare.6855056)

To cite this session: Peng, G., M.J. Brewer, R. Duerr, W. Wright, and C. Lief, 2018: Seeking feedback from the ESIP community on two developing dataset maturity assessment models. Session. The Earth Science Information Partners (ESIP) 2018 Summer Meeting, 17–20 July 2018, Tucson, AZ, USA.

Speakers & Moderators
avatar for Christina Lief

Christina Lief

Physical Scientist/Consultant, WMO
Recently retired from NOAA/NCEI after a 30 year career with the US Federal Government. I am a Physical Scientist specializing in global observing system climate data (remote sensed & in-situ, data access and documentation). I am presently working as a consultant for WMO as well as... Read More →
avatar for Ge Peng

Ge Peng

Research Scholar, CICS-NC/NCEI
Dataset-centric scientific data stewardship, data quality management


Tuesday July 17, 2018 11:30am - 1:00pm
Canyon A

2:00pm

Semantics In Action
This two part session will focus on real-life use cases for ESIP's semantic technology resources, and hands-on tutorials to help ESIP members begin taking advantage of these resources. Part 1 will feature presentations from people who are using semantic resources: how they are using them and why, and what they are getting for their efforts. Part II will feature hands-on tutorials to help ESIP members use our semantic resources in their own environments to solve actual problems. Tutorials will include using schema.org for improved search rankings; using JSON LD for linked data; using COR and Bioportal for document annotation.

AGENDA:

Semantic Search in Action in ArcGIS Hub, Pranav Kulkarni, ESRI R & D Center

Esri's ArcGIS Hub has an improved search experience by implementing semantic search using a knowledge graph. The search is context-aware and provides a great user experience in search and discovery of data on ArcGIS Hub.
Pranav Kulkarni will be talking about how his team implemented semantic search at scale and how the knowledge graph can be grown with custom vocabularies further improving the search results.

Semantics in Action in the Cryosphere, Ruth Duerr The Cryospheric science and polar regions communities have a number of organizations and activities, both global and national, that are trying to pull together the observations and data needed to understand the rapid changes occurring in the Arctic.  As part of these activities, work is going on across the full range of the semantic spectrum - everything from controlled vocabularies, to glossaries, to full blown ontologies.  Ruth will discuss some of these activities, their underlying use cases, and how they tie to other semantics activities in ESIP and Earth science generally.

Using the Environment Ontology (ENVO), Pier Luigi Buttigieg, Alfred Wegener Institute Helmholtz Centre for Polar and Marine Research 

The Environment Ontology (ENVO) is a community ontology for the machine-readable representation of environmental entities. ENVO has been built along the best practices of the Open Biological and Biomedical Ontology Foundry and Library, thus reuses and aligns to a suite of existing ontologies to express environmental entities such as geographic, astronomical, and anthropogenic features as well as the processes they participate in. The ontology’s initial uses were in the life sciences, and thus focused on entities such as biomes and ecosystems. It has become a standard resource in the genomes and microbiome communities, and is steadily being adopted in other disciplines. Most recently, ENVO has seeded and interoperates with ontologies in the domains of agronomy, food science, and - in collaboration with UN Environment - the Sustainable Development Goals. It also is providing semantic expression for a number of existing and emerging standard vocabularies, extending their functionality.

 Pier Luigi will discuss typical usage scenarios for the Environment Ontology, including its recent deployment in the UNESCO/IOC-IODE Ocean Best Practice repository and an example of combining ENVO and Gene Ontology to mobilise data  in environmental genomics.





Speakers & Moderators

Tuesday July 17, 2018 2:00pm - 3:30pm
Canyon A

4:00pm

Semantics In Action
This two part session will focus on real-life use cases for ESIP's semantic technology resources, and hands-on tutorials to help ESIP members begin taking advantage of these resources. Part 1 will feature presentations from people who are using semantic resources: how they are using them and why, and what they are getting for their efforts. Part II will feature hands-on tutorials to help ESIP members use our semantic resources in their own environments to solve actual problems. Tutorials will include using schema.org for improved search rankings; using JSON LD for linked data; using COR and Bioportal for document annotation. If you're a schema.org enthusiast, or just trying to figure out whether you might be an enthusiast, you've come to the right place this year. Other sessions on schema.org include a session on publishing using schema.org Dataset (http://sched.co/EyqH), and a session on assessing the readiness of community metadata for schema.org (http://sched.co/Eypl).

AGENDA

Schema.org, Doug Fils, Consortium for Ocean Leadership & Adam Shepherd Woods Hole There is an emerging practice to leverage structured metadata to aid in the discovery of web based resources.  Much of this work is taking place in the context (no pun intended) of schema.org and has extended to the resource type Dataset.  This session will present approaches, tools and references that will aid in the understanding and development of schema.org in JSON-LD and its connection to external vocabularies.  The goal of this session is to provide you the basics to generate example documents that can be tested and validated with various tools. This will form a foundation that can be used for the development of code or delivery solutions in your systems to expose data sets and their associated structured metadata following FAIR principles and leveraging schema.org.  


Speakers & Moderators

Tuesday July 17, 2018 4:00pm - 5:30pm
Canyon A
 
Wednesday, July 18
 

4:00pm

Earth Science Data Uncertainty – White Paper Development
During the 2017 ESIP Summer Meeting, the Information Quality Cluster (IQC) sponsored a plenary panel session and a breakout session focused on Earth science data uncertainty. Expert panellists presented to the ESIP audience key aspects of scientific quality and addressed questions such as "How is uncertainty determined and characterized in the products of their research or application? What are the major side effects and limitations of common statistical techniques used to quantify and characterize uncertainty? What is the impact of uncertainty on the quality of their data products? How is data uncertainty accounted for when multiple sources of data are spliced and woven into a single product? How do they document and convey the information about uncertainty to other scientific users? What is the best way of conveying uncertainty to the (possibly skeptical) public?" Following considerable discussion during the breakout session, a one of the key action items recommended was that a clear understanding of the concept of uncertainty, and its communication to users was essential, and that the IQC should develop a white paper to satisfy this objective. During the 2017 ESIP Winter Meeting, the IQC held a session titled "Formulation of a White Paper on Earth Science Data Uncertainty" where further presentations by experts was held about the mathematical basis for uncertainty as well as uncertainty from the point of view of scientific data producers and applications' users, followed by subgroup discussions for formulating the white paper. An outline of the white paper has been developed and reviewed by IQC members and a number of individuals have signed up to be co-authors and/or reviewers.

The purpose of this breakout session is to continue this progress by acquiring a final set considerations and recommendations from additional domain experts spanning multiple disciplines of Earth science and data science. This will be followed by a working session to finalize the outline and establish the final points and issues that should be addressed by the paper.

The following is a list of speakers, titles and brief descriptions of the talks to be delivered by the panelists:

Michael Little – NASA AIST Program
Presentation Title: “An AIST Program View of the Significance of Uncertainty in Data Exploitation”
Presentation Summary: While uncertainty is important, everyone has a different idea about what it means, how to characterize it and what it should be used for. I hope to provide a brief overview of how I think it’s used in programmatic decisions.

Jeff Privette – NOAA/NCEI
Presentation Title: “NOAA/NCEI’s Approaches to Informing Users on Uncertainty”
Presentation Description: Determining uncertainty in environmental data is critical but typically challenging and expensive. Therefore, many scientists either ignore, guess at, or use validation studies to estimate it. This can lead to misleading results in climate monitoring, especially in derivative or time-series results such as climate anomalies or rankings. NCEI is adopting a policy to inform data users of what uncertainty, validation, and quality assurance has been applied so that users are informed prior to ordering the data.

Faozi Said – NOAA/NESDIS
Presentation Title: “Leveraging Cal/Val to Effectively Communicate the Physical Data Limitations”
Presentation Summary: Using a real world example, we explore on a high level the calibration and validation (Cal/Val) challenges we face and how the possible issues and uncertainty in the data are communicated to the end user.

Jonathan Hobbs – Jet Propulsion Laboratory
Presentation Title: “Probability as a Foundation for Data Uncertainty: Applications in Remote Sensing”
Presentation Summary: Earth science data records often include products that combine models with indirect observations of their quantity of interest. Probability serves as a foundational tool for characterizing uncertainty for these complex methods. This presentation will illustrate these ideas in the context of remote sensing retrievals.


Speakers & Moderators
ML

Mike Little

AIST Program Manager, NASA
DM

David Moroni

Data Stewardship and User Services Team Lead, Jet Propulsion Laboratory, Physical Oceanography Distributed Active Archive Center
I am a Senior Science Data Systems Engineer at the Jet Propulsion Laboratory and Data Stewardship and User Services Team Lead for the PO.DAAC Project, which provides users with data stewardship services including discovery, access, sub-setting, visualization, extraction, documentation... Read More →
avatar for Ge Peng

Ge Peng

Research Scholar, CICS-NC/NCEI
Dataset-centric scientific data stewardship, data quality management
JP

Jeff Privette

NOAA/NCEI
avatar for Hampapuram Ramapriyan

Hampapuram Ramapriyan

Research Scientist/SME, Science Systems and Applications, Inc.
Information Quality, Data Stewardship, Provenance, Preservation Standards
FS

Faozi Said

NOAA/NESDIS


Wednesday July 18, 2018 4:00pm - 5:30pm
Canyon A
 
Thursday, July 19
 

9:30am

Preparing for the CoreTrustSeal - Insights and Lessons Learned
As part of a coalition convened by the American Geophysical Union (AGU) and also in collaboration with the Research Data Alliance, ESIP is fostering capabilities to make data Findable, Accessible, Interoperable, and Reusable (FAIR) by contributing to the Enabling FAIR Data project. Among many key contributors and stakeholders of the effort, data repositories and their capabilities are vital to enable and facilitate the FAIR principles as developed by Force11.org.

In order to understand the data repositories’ services and their maturity levels, there are several options for assessing a data repository, and one possibility that is being considered by the Enabling FAIR Data project is the CoreTrustSeal. While the CoreTrustSeal has provided guidelines and tutorials regarding the assessment process, a data repository might still have questions regarding the best approach in evaluating its services per the CoreTrustSeal requirements.

During this session, the speakers will present their experiences to advise potential or would-be CoreTrustSeal applicants on several key areas including licensing, ethical norms, funding and preservation level. The speakers will also interact with the audience to answer any additional questions or concerns that the audience might have.

Session Agenda:
  1. Introduction - Sophie Hou and Doug Schuster (National Center for Atmospheric Research)
  2. Presentations:
    • Lisa Johnston – University of Minnesota
    • Roger Weaver – Missouri University of Science and Technology
    • Bob Downs – CIESIN-SEDAC, Columbia University
  3. Discussion/Q&As
  4. Next steps

Speakers & Moderators
SH

Sophie Hou

Data Curation and Stewardship Coordinator, National Center for Atmospheric Research
data management/curation/stewardship: including but not limited to data life cycle, policies, sustainability, education and training, data quality, usability.
avatar for Kerstin Lehnert

Kerstin Lehnert

Lamont-Doherty Earth Observatory
Kerstin Lehnert, Chair of the EarthCube Leadership Council (December 2015 - May 2018), is Senior Research Scientist at the Lamont-Doherty Earth Observatory of Columbia University and Director of the NSF-funded data facility IEDA (Interdisciplinary Earth Data Alliance). Kerstin holds... Read More →
avatar for Roger Weaver

Roger Weaver

Scholarly Communications Librarian, Missouri University of Science and Technology


Thursday July 19, 2018 9:30am - 11:00am
Canyon A

11:30am

Earth Science Data Uncertainty – White Paper Development – Working Session
During the 2017 ESIP Summer Meeting, the Information Quality Cluster (IQC) sponsored a plenary panel session and a breakout session focused on Earth science data uncertainty. Expert panellists presented to the ESIP audience key aspects of scientific quality and addressed questions such as "How is uncertainty determined and characterized in the products of their research or application? What are the major side effects and limitations of common statistical techniques used to quantify and characterize uncertainty? What is the impact of uncertainty on the quality of their data products? How is data uncertainty accounted for when multiple sources of data are spliced and woven into a single product? How do they document and convey the information about uncertainty to other scientific users? What is the best way of conveying uncertainty to the (possibly skeptical) public?" Following considerable discussion during the breakout session, a one of the key action items recommended was that a clear understanding of the concept of uncertainty, and its communication to users was essential, and that the IQC should develop a white paper to satisfy this objective. During the 2017 ESIP Winter Meeting, the IQC held a session titled "Formulation of a White Paper on Earth Science Data Uncertainty" where further presentations by experts was held about the mathematical basis for uncertainty as well as uncertainty from the point of view of scientific data producers and applications' users, followed by subgroup discussions for formulating the white paper. An outline of the white paper has been developed and reviewed by IQC members and a number of individuals have signed up to be co-authors and/or reviewers.

This IQC session represents a follow-on to the preceding ESIP 2018 Summer Meeting IQC breakout session which intends to provide an opportunity to acquire a final set considerations and recommendations from additional domain experts toward the development of the white paper. This purpose of this working session is to finalize the white paper outline and establish the final points and issues that should be addressed by the paper. This session will also function as a “last call” for any additional requests for co-authorship or reviewers for the proposed paper.


Speakers & Moderators
DM

David Moroni

Data Stewardship and User Services Team Lead, Jet Propulsion Laboratory, Physical Oceanography Distributed Active Archive Center
I am a Senior Science Data Systems Engineer at the Jet Propulsion Laboratory and Data Stewardship and User Services Team Lead for the PO.DAAC Project, which provides users with data stewardship services including discovery, access, sub-setting, visualization, extraction, documentation... Read More →
avatar for Ge Peng

Ge Peng

Research Scholar, CICS-NC/NCEI
Dataset-centric scientific data stewardship, data quality management
avatar for Hampapuram Ramapriyan

Hampapuram Ramapriyan

Research Scientist/SME, Science Systems and Applications, Inc.
Information Quality, Data Stewardship, Provenance, Preservation Standards


Thursday July 19, 2018 11:30am - 1:00pm
Canyon A
 
Friday, July 20
 

9:30am

Science Gateways in the Cloud, a Platform for Providing Modern Scientific Workflows for Reproducible Research and Collaboration
The advent and maturity of cloud computing technologies and tools have opened new avenues for addressing both Big Data and Open Science challenges to accelerate scientific discoveries. There is broad consensus that as data volumes grow rapidly, it is particularly important to reduce data movement and bring processing and computations to the data. Data providers also need to give scientists an ecosystem that includes data, tools, workflows and other end-to-end applications and services needed to perform analysis, integration, interpretation, and synthesis - all in the same environment or platform. Instead of moving data to processing systems near users, as is the tradition, one will need to bring processing, computing, analysis and visualization to data – so called data-proximate workbench capabilities, also known as server-side processing.

Cloud-based Science Gateways, through online portals and user-friendly interfaces, provide access to a range of resources that are of interest to a community of researchers, educators, and students, including datasets, tools, services, and workspaces. These offerings permit researchers to access a suite of capabilities to not only achieve reproducible science in a web-based workspace but also provide a platform for collaboration and conducting team science. In this session, speakers will present on-going efforts to develop cloud-based Science Gateways to facilitate end-to-end scientific workflows for communities of researchers, educators, and students in the geosciences.

Google Drive for this Session

Nancy Wilkins-Diehr, San Diego Supercomputing Center (15 minutes)
Title: Science Gateways Overview

Mohan Ramamurthy, Unidata (15 minutes)
Title: Unidata Science Gateway

Eric Lingerfelt, EarthCube Science Support Office (15 minutes)
Title: Towards an Earthcube Science Gateway

Julien Chastang, Unidata (15 minutes)
Title: Unidata Science Gateway JupyterHub

Discussion (with whatever time remains)

Speakers & Moderators
avatar for Julien Chastang

Julien Chastang

Software Engineer, UCAR - Unidata
Scientific software developer at UCAR-Unidata.
avatar for Eric Lingerfelt

Eric Lingerfelt

Technical Officer, EarthCube
Eric Lingerfelt is the EarthCube Science Support Office Technical officer and comes to ESSO from Oak Ridge National Laboratory in Oak Ridge, Tennessee, where he specialized in the design, development, and deployment of full stack application systems in support of multiple areas of... Read More →
avatar for Nancy Wilkins-Diehr

Nancy Wilkins-Diehr

Associate Director, San Diego Supercomputer Center
Science gateways and running



Friday July 20, 2018 9:30am - 11:00am
Canyon A