|
|
Line 1: |
Line 1: |
− | {{DataSystemProfile
| + | |
− | |DataSystemName=DataFed
| |
− | |DataSystemURL=http://datafedwiki.wustl.edu
| |
− | |ContactName=Rudy Husar
| |
− | |Contacte-mail=rhusar@me.wustl.edu
| |
− | |About=DataFed is Web services-based software that non-intrusively mediates between autonomous, distributed data providers and users. DataFed is designed in accordance with the GEOSS architecture; It provides standard interfaces to heterogeneous distributed data, fosters data integration and use with processing web services and tools, and collects metadata and user-feedback on datasets. DataFed also provides standards-based data feeds to the NASA Giovanni System.
| |
− | |DataSystemHistory=The federated data system, DataFed, was in development since 2001 at Washington University, CAPITA, with grants from [http://capita.wustl.edu/capita/researchareas/NSFPropSubm/FinalReport/FinalReport_050629.pdf NSF], [http://capita.wustl.edu/capita/researchareas/NASAReason/ESEPMNASAReasonAbstr.htm NASA], [http://capita.wustl.edu/NEISGEI/main.html EPA] and [http://datafedwiki.wustl.edu/images/c/c4/EM_DataFed_FASTNET_050720.pdf RPOs]. Since 2004, DataFed served both Regulatory and Policy support to EPA. Within CAPITA, DataFed has become a scientific data analysis tool.
| |
− | |DataSystemAgencies=Washington University
| |
− | |DataSystemRef=[http://datafedwiki.wustl.edu/index.php/FASTNET FASTNET],[http://datafedwiki.wustl.edu/index.php/2005/6_Exceptional_Events_Project EPA Exceptional Event Project], [http://datafedwiki.wustl.edu/index.php/2005/6_Exceptional_Events_Project Interoperability of Web Service-Based Data Access and Processing...] ESTO 2006, [http://datafedwiki.wustl.edu/index.php/CATT Combined Aerosol Trajectory Tool, CATT], [http://capita.wustl.edu/capita/capitareports/070725IGARSS07_Barcelona/DataFed_IGARSS07_Barcelona_ER3.pdf DataFed: Mediated Web Services for Distributed AQ Data Access and Processing] IGARS 2007, [http://datafedwiki.wustl.edu/index.php/2007-01-25_HTAP_Mtg_Geneva Interoperable Info System of Systems for HTAP] HTAP 2007
| |
− | |DataSystemDataSets=[http://datafedwiki.wustl.edu/index.php/Compact_Catalog_-_Alphabetical See Dataset Catalog]
| |
− | |DataSystemParam=DataFed provides access to over [http://datafedwiki.wustl.edu/index.php/Compact_Catalog_-_Alphabetical 100+] distributed, air-quality relevant datasets (surface, satellite, and model) which can be explored and analyzed by tools for processing and visualization.
| |
− | |DataSystemCoverage=About half of the datasets are global scale, a third are US-scale, while some datasets are for other regions. Most datasets are multi-year in extent. About a ten datasets are near-real-time.
| |
− | |DataSystemAppHealth=No applications to health studies. However, the datasets mediated through DataFed are suitable for health studies, particularly in conjunction with the 1km-resolution US population data.
| |
− | |DataSystemAppFcstReAnaly=A current NASA project with [http://groups.google.com/group/nasa-aq-forecast BAMS] uses DataFed to assimilate surface obs. into a forecast model. We are not aware of any formal Air Quality Reanalysis effort; hopefully, thee community will
| |
− | |DataSystemAppModelEval=The [http://capita.wustl.edu/NEISGEI/main.html EPA NEISGEI Project] uses DataFed to integrate and to evaluate multiple emission databases. DataFed was used to prepare an [http://capita.wustl.edu/models3eval/IMPComp/FinalReporModels3.htm evaluation of the CMAQ Aerosol Model with IMPROVE and FRM data]. In the NASA project with [http://groups.google.com/group/nasa-aq-forecast BAMS], DataFed provides surface observations for assimilation into a forecast model. Add HTAP data integration...
| |
− | |DataSystemAppCharact=DataFed was used to perform aerosol characterization for the RPO project, [http://datafedwiki.wustl.edu/index.php/FASTNET FASTNET]. DataFed is the main data source supporting the development of [http://datafedwiki.wustl.edu/index.php/2005/6_Exceptional_Events_Project EPA's Exceptional Event Rule]. It is now used in the [[Exceptional_Air_Pollution_Event_Analysis_Community_Workspace| implementation of the EE Rule.]]
| |
− | |DataSystemAppOther=Since 2004, a major role of DataFed was to participate in [[EPA_Air_Quality_Data_Systems_and_GEOSS_Architecture| interoperability experiments for GEOSS]].
| |
− | |PrimaryDataStorage=DataFed is a mediator of data flow between providers and users. It does not primary/official data.
| |
− | |DataSystemValueConsolidation=Data consolidation from heterogeneous to homogeneous structure is performed on the fly for most datasets. Many historical datasets are cached at DataFed for fast data access and browsing.
| |
− | |DataSystemValueAccess=DataFed is a homogenizer of distributed, heterogeneous datasets through [http://esto.nasa.gov/conferences/ESTC2006/papers/a6p2.pdf data 'wrappers']. As a result all the data mediated in DataFed are accessible through international [http://esto.nasa.gov/conferences/ESTC2006/papers/a6p2.pdf standard data access services, OGC WCS and WMS]. At this time all data access services are free and offered through an open interface.
| |
− | |DataSystemValueProcess=The processing of raw data is performed by reusable web-service components, which include filtering, aggregation, and data fusion services. Data processing applications are created by chaining services using workflow software.
| |
− | |DataSystemValueVis=The visualization tools for parameter-spatial-temporal browsing are applicable for each dataset in the federated data system. The output data from the processing services are also available for mashups with other popular tools e.g. [http://datafedwiki.wustl.edu/index.php/2007-07-18_ESIP_Demo_OMI_NO2 Google Earth] and GIS software.
| |
− | |DataSystemValueDecisionSupport=DataFed has served the RPOs through the [http://datafedwiki.wustl.edu/index.php/FASTNET FASTNET] and [http://datafedwiki.wustl.edu/index.php/CATT Combined Aerosol Trajectory Tool (CATT)] projects. More recently DataFed supports the decisions for the Exceptional Event Rule for PM2.5 and ozone.
| |
− | |EndtoEndIntegration=Data access, processing and visualization are all performed within DataFed. Specific workflow configurations are created from the loosely coupled web services for different scientific analysis or decision-support applications. An example custom workflow is the [http://datafedwiki.wustl.edu/index.php/CATT Combined Aerosol Trajectory Tool (CATT)].
| |
− | |DataSystemArchInterop=Both the raw input data as well as the processed outputs are accessible through international standard interfaces. This allows the creation of [http://datafedwiki.wustl.edu/index.php/2007-07-25_IGARSS07_Barcelona loosely-coupled network applications] ([http://capita.wustl.edu/capita/capitareports/070725IGARSS07_Barcelona/DataFed_IGARSS07_Barcelona_ER3.pdf Paper-PDF]).
| |
− | |DataSystemArchToolsMethods=The data access, processing and visualization services in DataFed are all composed of reusable Web Services through both SOAP and REST protocols ([http://capita.wustl.edu/capita/capitareports/070725IGARSS07_Barcelona/DataFed_IGARSS07_Barcelona_ER3.pdf Paper-PDF]).
| |
− | |DataSystemArchSecurity=The data access and processing services are accessible through the SOAP-WSDL protocol, which is designed to pass through firewalls. At this time there are no access restrictions to these services.
| |
− | |DataSystemArchUserFeedbck=For each dataset registered in DataFed there is a "DataSpace" wiki page for the collection of dataset-relavent information, including user feedback (e.g. [[AIRNOW|AirNOW]]).
| |
− | |DataSystemArchOther=The DataFed architecture has been used as a model for demonstrating the [http://datafedwiki.wustl.edu/index.php/2007-01-25_HTAP_Mtg_Geneva "System of Systems" aspect of GEOSS].
| |
− | }}
| |