[Apologies if you got multiple copies of this email.]


Data: Acquisition To Analysis (DATA) workshop

****Part of SenSys/BuildSys 2019****

Columbia University, New York, USA


The Data: Acquisition To Analysis (DATA) workshop aims to look broadly at interesting data from interesting sensing systems. The workshop considers problems, solutions, and results from all across the real-world data pipeline. We solicit submissions on unexpected challenges and solutions in the collection of datasets, on new and novel datasets of interest to the community, and on experiences and results—explicitly including negative results—in using prior datasets to develop new insights.

The workshop aims to bring together a community of application researchers and algorithm researchers in the sensing systems and building domains to promote breakthroughs from the integration of the generators and users of datasets. The workshop will foster a cross-domain understanding by enabling both the understanding of application needs and data collection limitations.

The workshop seeks contributions across two major thrusts, but is open to a broad view of interesting questions around the collection, dissemination, and use of data as well as interesting datasets:

*** The collection and use of data

- Challenges and solutions in data collection, especially around security and privacy

- Expectations and norms for data collection from sensor networks, especially those that involve human factors

- Novel insights from existing datasets

- Metadata management for complex datasets

- Synthetic data, including its generation, application, and utility

- Success stories—key properties of useful datasets and how to generalize these

- Shortcomings of prior datasets—and how to address these in the future

- Position papers on policies and norms from experimental design through data management and use are explicitly welcomed.

*** New and interesting datasets, including but not limited to:

- Shopping-related sensing data

- Animal-related data or sensed data

- Anonymized health, or synthetic health-related data

- Indoor localization, especially unprocessed/unfiltered physical layer measurements

- Smart building, occupancy, motion data, energy, human comfort, vibration, BIM

- Vehicular, GPS, cellular, or wifi traces

- Reproductions of prior work that validate, refute, or enhance results

To enable the longevity of submitted datasets, we plan on providing a central location where a repository for the data, and information about the data can be archived for at least 5 years.

****Submission Guideline****

Submissions may range from 1-5 pages in PDF format, excluding references, using the standard ACM conference template. Submissions are strongly encouraged to use only as much space as needed to clearly convey the significance of the work—we fully expect many submissions, especially datasets, to use only 1-2 pages, but wish to allow those interested in fully elucidating positions on data collection and use or insights from reproducibility efforts ample space to do so. Submissions should use only as much space as necessary to clearly convey their ideas and contributions.

Dataset submissions should prefix paper titles with “Dataset: “ and must include a description of the dataset as well a reasonable accompanying data sample. Once accepted, a full described dataset must be shared to a public repository by the camera-ready deadline. Issues on licenses will be resolved by generally following the procedure similar to CRAWDAD (<>) and special treatments, if needed, will be discussed separately with the TPC chairs.

Each accepted submission is required to have at least one author attend the workshop and present to the workshop attendees.

****Important Dates****

Submission deadline: August 2nd, 2019, 11:59:59 pm AOE, submit HERE<>

Notifications: August 12th, 11:59:59 pm AOE

Camera-ready: August 16th, 2019, 11:59:59 pm AOE

****Program Chairs****

Shijia Pan<> (Carnegie Mellon University, USA)

Pat Pannuto<> (University of California at Berkeley, USA)

Flora Salim<> (RMIT University, Australia)

Mikkel Baun Kjærgaard<> (University of Southern Denmark, Denmark)

****Steering Chairs****

Jie Gao (Stony Brook University, USA)

Pei Zhang (Carnegie Mellon University, USA)

Prabal Dutta (University of California Berkeley, USA)

Jie Liu (Harbin Institute of Technology, China)

****Program Committee****

Jorge Ortiz (Rutgers University)

Wen Hu (University of New South Wales)

Jun Han (National University of Singapore)

Xiaoxuan Chris Lu (Oxford)

Olga Saukh (TU Graz)

Brano Kusy (CSIRO)

Wan Du (University of California, Merced)

Arun Vishwanath (IBM Research Australia)

Clayton Miller (National University of Singapore)

Salvatore Carlucci (NTNU)

Rachel Cardell-Oliver (the University of Western Australia)

Zoltan Nagy (UT Austin)

Mohammad Saiedur Rahaman (RMIT University)

Fisayo Caleb Sangogboye (University of Southern Denmark)

Yongli Ren (RMIT University)

Jason Koh (University of California, San Diego)

