IMC 2011 - Breakout I: Data package usablilty

Return to 2011 Meeting Agenda

For intro slides, see PPT attached at bottom of page.
Also see: Gastil's notes from EIMC Birds of a feather session, attached (pdf) at bottom of page

Proposing 3 breakouts:

1. EML dataset clinic
2. Creating workflow-aware datasets
3. Report requirements and policies

1. EML dataset clinic
facilitator: Gastil [we may need a second person, perhaps Duane.]
We now have draft reports from the first iteration of the EML Congruence Checker for all site datasets submitted as of mid August. It is quite likely that some of this data inventory may not be acceptable for automated use without significant changes to the EML and/or data files. So far, only a few information managers have had detailed experience with the reports. This breakout will be an opportunity for IMs to examine their own datasets and reports in detail, and will be facilitated by a member of the DataManager Tiger Team. If attendance can be confirmed ahead of time, examples from participants' documents can be prepared. Participants will examine datasets closely, become familiar with potential problems, and possibly run the congruence checker v0.1 on datasets in real-time (pending availability of the web service). The subject of this clinic overlaps with the scheduled EIMC Birds of a Feather session entitled "Functional Requirements for the EML Congruence Checker".

2. Creating workflow-aware datasets
facilitator - Wade [TBA: Mark Servilla to represent the NIS]
It is planned that the NIS will house workflows for network synthesis projects which use site-contributed datasets as input. However, several processes are not yet clear, and our current guidelines (e.g, EML Best Practices) do not address actual dataset use. Will workflows be written with certain data formats in mind? Should workflows expect to run on future versions of the same dataset? What happens to existing workflows when the site (data contributor) re-designs the original product? Site IMs are the primary designers of these datasets. As such, they are uniquely qualified to propose a) guidelines for dataset construction with workflows in mind, and b) processes and expectations for use and continuity of datasets in synthesis projects. This breakout will summarize the probable issues that will arise with workflows, and outline likely processes and guidelines that will be necessary to address those issues. The breakout participants may consider proposing a production workshop for 2012 to produce needed documentation. This session is related to the EIMC Birds of a Feather session entitled "Using Web Tools and Methods", the plenary discussion "Community Standards and Practices", and the funded workshop scheduled for 2012, "NIS Workflow Development".
Notes: http://im.lternet.edu/sites/im.lternet.edu/files/Notes_ECC_Workflows_Bre...

3. Report requirements and policies
facilitator: Margaret [or TBA]
Site IMs have received draft reports from the first iteration of the EML Congruence Checker and a final version from v0.1 will be available at the end of 2011. Most of us have thought of the ECC as a tool for IMs to check quality of datasets for various uses, and to justify asking for funds to improve our systems. But like all technological advances, there are other ramifications. For example, who will be interested in reports, especially summaries, and why? Do reports reflect on a site's performance? How much information should be publicly distributed? Some of us have already brought up these topics with our site PIs. This breakout group will have 2 charges: 1) outline and describe the potential audiences outside the IMC for dataset reports (eg, LPIs, LTER EB, synthesis working groups, NSF), including their anticipated areas of interest, appropriate report content, and features to summarize. 2) propose a timeline, which is aligned with the NIS development timeline, for introducing audiences to the uses of dataset reports, and for the development of certain policies to be discussed with the EB. The breakout group will have the current sample report available for reference. This activity might eventually lead to guidelines for datasets similar to those the IMC developed for website design. This breakout is related to (but does not overlap with) the scheduled EIMC Birds of a Feather session entitled "Functional Requirements for the EML Congruence Checker".
Notes: http://im.lternet.edu/sites/im.lternet.edu/files/Notes_ECC_Breakout_I_po...

AttachmentSize
Introductory slides: Breakout_1_DP_usability_ECC.ppt198.5 KB
Notes: EIMC_2011_BoF_ECC.pdf84.4 KB