CRAWDAD metadata: umich/virgil/eval_data (v. 2008-03-28)
We collected the trace set through war walking, i.e., collecting Wi-Fi beacons
by walking around the neighborhoods in different cities in the United States,
for the evaluation of Virgil, an access point selection system.
[xml metadata]
Note: This metadata was prepared by the CRAWDAD team and verified by the data set (or tool) authors. We have made every effort to ensure its accuracy, but urge all users to consider the metadata and data carefully and be sure that their use in research is consistent with the nature and limitations of the data. We welcome any corrections.
CRAWDAD metadata structure[what is CRAWDAD metadata]
- [Data]
- [Tools]
- [Authors]
- [Author] Anthony J. Nicholson
- [Author] Yatin Chawathe
- [Author] Mike Chen
- [Author] Brian Noble
- [Author] David Wetherall
- [Papers]
[Traceset] umich/virgil/eval_data (v. 2008-03-28) | top |
| version | v. 2008-03-28 |
| changes | the initial version. |
| bibtex |
@MISC{umich-virgil-eval_data-2008-03-28,
author = {Anthony J. Nicholson and Yatin Chawathe and Mike Chen and Brian Noble and David Wetherall},
title = {{CRAWDAD} trace set umich/virgil/eval_data (v. 2008-03-28)},
howpublished = {Downloaded from http://crawdad.cs.dartmouth.edu/umich/virgil/eval_data},
month = mar,
year = 2008
}
|
| metadata last modified | 2008-03-31 |
| summary | We collected the trace set through war walking, i.e., collecting Wi-Fi beacons by walking around the neighborhoods in different cities in the United States, for the evaluation of Virgil, an access point selection system. |
| release date | 2008-03-28 |
| measurement start | 2005-07-18 |
| measurement end | 2005-09-16 |
| measurement purposes | Opportunistic Connectivity |
| methodology | We briefly summarize our methodology here---for full details, please refer to our paper: A.J. Nicholson et al., "Improved Access Point Selection", in proceedings of MobiSys 2006. We walked each neighborhood with a Compaq iPAQ that contained an 802.11 wireless card. Unlike for the field study data set, we did not just periodically scan for available APs and test their capabilities. The Virgil AP selection daemon periodically scanned and tested APs to locate a usable AP, but once one was found, it stuck with it until the iPAQ passed out of its radio range. As a result, users of this dataset will notice significant gaps in between scan sets. This is the time during which the device was associated with an access point. Note: due to a bug, all test results (AP frequency, signal strength, et cetera) for APs using WEP encryption were mistakenly set to 0 when Virgil wrote out the logs. This did not affect our results because none of the algorithms in the evaluation attempted to use these encrypted, inaccessible APs. We regret, however, that this data on the link-layer properties of these encrypted APs is unavailable to the user. We recommend the field study dataset for those who require such data. Also note that, unlike in the field study, the Virgil daemon caches test results for performance. Therefore, once a given AP is seen in a neighborhood trace, when it is subsequently detected the application-level tests are not re-run, but rather the cached test results written out to the log. |
| parent data | umich/virgil (v. 2008-03-28) |
| traces included | umich/virgil/eval_data/warwalk (v. 2008-03-28) |
[Trace] umich/virgil/eval_data/warwalk (v. 2008-03-28) | top |
| version | v. 2008-03-28 |
| changes | the initial version |
| bibtex |
@MISC{umich-virgil-eval_data-warwalk-2008-03-28,
author = {Anthony J. Nicholson and Yatin Chawathe and Mike Chen and Brian Noble and David Wetherall},
title = {{CRAWDAD} trace umich/virgil/eval_data/warwalk (v. 2008-03-28)},
howpublished = {Downloaded from http://crawdad.cs.dartmouth.edu/umich/virgil/eval_data/warwalk},
month = mar,
year = 2008
}
|
| metadata last modified | 2008-03-31 |
| summary | War-walking traceset collected in different cities in the United States for the evaluation of an access point selection system. |
| derived | false |
| release date | 2008-03-28 |
| measurement start | 2005-07-18 |
| measurement end | 2005-09-16 |
| configuration | For our evaluation, data was collected in five different neighborhoods, of three different cities in the United States. All timestamps in the datafiles are UTC, so the local times must be calculated accordingly. Because daylight savings time was in effect, Ann Arbor was UTC-4, Chicago UTC-5, and Seattle UTC-7. Neighborhoods: Chicago Loop (loop): the central business district. Data was collected during the day on a busy workday (Tuesday, 19 July 2005, 3:30-4:35 pm local time). Chicago, Wicker Park (wkpk): a high-density residential neighborhood northwest of downtown. Data collected on Monday, 18 July 2005, 7:40-9:13 am local time. Chicago, Evanston (evanston): a suburb and college town, north of the city limits. Data collected on Monday, 18 July 2005, 11:44 am to 3:20 pm. Downtown Seattle (seattle): the central business district. Data was collected on Wednesday, 20 July 2005, 7:18pm until 12:03am on July 21st (five hours later). Ann Arbor, Michigan: the downtown area. Friday, 16 September 2005, 9:41-10:44 am. For all three neighborhoods, we walked a roughly 1/2 square-mile (1.3 square-kilometer) area on the sidewalk (following the street grid pattern). |
| format | The evaluation data in the eval_data directory. Inside each directory,
the user will find a schema file, which describes in detail the format
and proper interpretation of the data files in each dataset.
In the eval_data directory, for each of the five neighborhoods, we provide a
scansets.<neighborhood> file. This file consists of a series of scan
sets. A scan set is defined as the test results for a given set of
APs, whose AP beacons the Virgil daemon detected when searching for a
new AP at a given physical spot.
The first line of each scan set is of the form:
SCAN_SET 3 |2005-07-21_02:19:16.808727
where the "3" denotes this is the third scan set in the neighborhood's
trace, and the remainder of the line is the time instant (in UTC) at
which the scan occured.
The remainder of each scan set consists of a series of lines, where
each line corresponds to an AP in the scan set. Each line is a series
of comma-separated values comprising the test result for the AP in
question:
struct ap_db_entry {
ssid, AP SSID
mac_addr, AP MAC address
encryption, is WEP enabled? {ON,OFF}
linkquality, link quality, x/92, from iwconfig
signallevel, signal level, -x dBm, from iwconfig
noiselevel, noise level, -x dBm, from iwconfig
channel, frequency (GHz) of the AP
dhcpsuccess, did AP grant DHCP address? (yes=1, no=0)
test_results optional test results (described below)
};
If the AP did not grant a DHCP address (dhcpsuccess==0), then the line
terminates with the dhcpsuccess parameter. Otherwise, the next item is
the round-trip-time (RTT) estimate in ms, then the bandwidth estimate
in bytes/sec. Finally, there is a sequence of tuples (port,status),
where port is a TCP port number, and status is one of {CLOSED=1,
OPEN=2, REDIRECTED=3}. Note that these constants are different than
those defined in the field study dataset. |
| sanitization | Only the SSIDs and MAC addresses have been altered. Each MAC address has been mapped from xx:xx:xx:xx:xx:xx format to a string of the form mac:<neighborhood><uniqid> where uniqid is an increasing value (starting at 0) for each neighborhood, determined by the order of appearance in the trace of a given AP. For example, if the 17th AP seen in the Wicker Park neighborhood had the MAC address 01:02:03:04:05:06, wherever this value appears in all log files, it would be replaced with the string "mac:wkpk:0016". If you are interested in the actual MAC addresses (to determine the popularity of various manufacturers' APs, for example) please contact us. We can provide such aggregate information without disclosing individual AP identities. The anonymized SSIDs are not tied to AP mac address, because many different APs often use the same mac address (linksys comes to mind). Instead we use the 32-bit MD5 hash of each SSID string. |
| parent data | umich/virgil/eval_data (v. 2008-03-28) |
[Author] Anthony J. Nicholson | top |
| tonynich@eecs.umich.edu | |
| institution | University of Michigan, Ann Arbor |
| department | Department of Electrical Engineering and Computer Science |
| position | Ph.D. student |
| address | Software Systems Laboratory, 2260 Hayward Ann Arbor, MI 48109-2121, USA |
| web site | http://www.eecs.umich.edu/~tonynich/ |
| related data/tools | umich/virgil (v. 2008-03-28) |
[Author] Yatin Chawathe | top |
| institution | Intel Research Seattle |
| position | Researcher |
| related data/tools | umich/virgil (v. 2008-03-28) |
[Author] Mike Chen | top |
| mike@ludic-labs.com | |
| institution | Ludic Labs |
| web site | http://www.mikechen.com/ |
| related data/tools | umich/virgil (v. 2008-03-28) |
[Author] Brian Noble | top |
| bnoble@eecs.umich.edu | |
| institution | University of Michigan, Ann Arbor |
| department | Department of Electrical Engineering and Computer Science |
| position | Associate Professor |
| address | University of Michigan, 1301 Beal Ave -- EECS 2245, Ann Arbor, MI 48109-2122 |
| web site | http://www.eecs.umich.edu/~bnoble/ |
| related data/tools | umich/virgil (v. 2008-03-28) |
[Author] David Wetherall | top |
| djw@cs.washington.edu | |
| institution | University of Washington |
| department | Department of Computer Science and Engineering |
| position | Associate Professor |
| address | Department of Computer Science and Engineering, University of Washington, Box 352350, Seattle, WA 98195-2350 |
| phone | 206-616-4367 |
| fax | 206-616-3804 |
| web site | http://www.cs.washington.edu/homes/djw/ |
| related data/tools | uw/sigcomm2004 (v. 2006-10-17) umich/virgil (v. 2008-03-28) |


