Download presentation
Presentation is loading. Please wait.
Published byRandolf Tyler Modified over 9 years ago
1
GDC: Group Discovery using Co-location Traces Steve Mardenfeld Daniel Boston Susan Juan Pan Quentin Jones † Adriana Iamntichi ‡ Cristian Borcea Department of Computer Science, New Jersey Institute of Technology † Department of Information Systems, NJIT ‡ Department of Computer Science, USF
2
Physical Groups Informally: groups of people that meet face to face ◦ Formal definition: Homans’ sociology book “The Human Group” Groups can be used in social or socially aware applications ◦ Recommender systems: recommend concerts to people who go to concerts together ◦ Data forwarding in delay-tolerant ad hoc networks: give priority to members of same group as destination when selecting next hop How to detect groups automatically?
3
Group Detection Using Location Traces Users carry mobile phones and upload location to central server Server analyzes location traces to detect groups In previous work, we developed an algorithm for group/place detection ◦ Achieved 96% accuracy with low false positives Problems:Location privacy Battery power
4
4 GDC: Use Bluetooth Co-location Traces Advantages ◦ Improved location privacy ◦ Low power consumption ◦ Practicality due to Bluetooth ubiquity in mobile phones ◦ Accuracy due to Bluetooth transmission range UserSeenTime AB1:00 BA1:05 INTERNET AB1:07 A BC BC1:05 AC1:07
5
5 Challenges Attendance at a group is variable People may be merely passing near a group, not remaining part of it Group members spend different lengths of time with the group Sampling frequency and user mobility can affect data completeness Each user may have a different perspective on the same meeting
6
6 Outline GDC Algorithm User Study Results Distributed GDC Conclusions
7
7 GDC in a Nutshell Transform raw Bluetooth records into meeting records between pairs of users Discover and record all combinations of users appearing at the same meeting (user clusters) Resolve differences in user perspectives on shared clusters Select all significant clusters and output as user groups
8
8 Creating Pair-wise Meeting Records Time Stamp UserUser With 11:02:01djb38jp238 11:02:01djb38mak43 11:04:14djb38jp238 11:04:14djb38mak43 11:07:05djb38mak43 Time Stamp UserUser With 11:02:15jp238djb38 11:02:15jp238mak43 11:05:02jp238mak43 11:07:50jp238djb38 11:07:50jp238mak43 Time Stamp UserUser With 11:01:30mak43jp238 11:01:30mak43djb38 11:04:18mak43jp238 11:10:10mak43jp238 Usermak43 Time Stamp User With 11:01:30jp238 11:01:30djb38 11:02:01djb38 11:02:15jp238 11:04:14djb38 11:04:18jp238 11:05:02jp238 11:07:05djb38 11:07:50jp238 11:10:10jp238 Userdjb38 Time Stamp User With 11:01:30mak43 11:02:01jp238 11:02:01mak43 11:02:15jp238 11:04:14jp238 11:04:14mak43 11:07:05mak43 11:07:50jp238 Userjp238 Time Stamp User With 11:01:30mak43 11:02:01djb38 11:02:15djb38 11:02:15mak43 11:04:14djb38 11:04:18mak43 11:05:02mak43 11:07:50djb38 11:07:50mak43 11:10:10mak43 Usermak43 User With Start Time End Time jp23811:01:3011:10:10 djb3811:01:3011:07:05 Userdjb38 User With Start Time End Time jp23811:02:0111:07:50 mak4311:01:3011:07:05 Userjp238 User With Start Time End Time mak4311:01:3011:10:10 djb3811:02:0111:07:05 Usermak43 User With Start Time End Time jp23811:01:3011:04:18 jp23811:07:5011:10:10 djb3811:01:3011:04:14 Userdjb38 User With Start Time End Time jp23811:02:0111:04:14 mak4311:01:3011:04:14 Userjp238 User With Start Time End Time mak4311:01:3011:05:02 mak4311:07:5011:10:10 djb3811:02:0111:04:14 Decreasing Meeting Granularity (MG) from 5 min to 2 ½ min produces noticeable changes
9
9 Creating User Clusters Usermak43 User With Start Time End Time jp23811:01:3011:10:10 djb3811:01:3011:07:05 Userdjb38 User With Start Time End Time jp23811:02:0111:07:50 mak4311:01:3011:07:05 Userjp238 User With Start Time End Time mak4311:01:3011:10:10 djb3811:02:0111:07:05 Usermak43 Users WithTime Spent jp238, djb3800:05:35 jp23800:08:40 djb3800:05:35 Userdjb38 Users WithTime Spent jp238, mak4300:05:04 jp23800:05:49 mak4300:05:35 Userjp238 Users WithTime Spent djb38, mak4300:05:04 djb3800:05:04 mak4300:08:40
10
10 Creating Global Clusters Resolve Perspective Differences ◦ Use Minimum Group Time (MGT) ◦ Use Minimum Group Meeting Frequency (MGMF) Usermak43 Users WithTime Spent jp238, djb3800:05:35 jp23800:08:40 djb3800:05:35 Userdjb38 Users WithTime Spent jp238, mak4300:05:04 jp23800:05:49 mak4300:05:35 Userjp238 Users WithTime Spent djb38, mak4300:05:04 djb3800:05:04 mak4300:08:40 ClusterMinimum TimeMin. Frequency djb38, jp238, mak4300:05:041 djb38, mak4300:05:351 djb38, jp23800:05:041 jp238, mak4300:08:401
11
11 Selecting the User Groups Identify and remove subgroups of significant groups ◦ Keep a subgroup if it meets double the time of the group that includes it ClusterMinimum Time djb38, jp238, mak4300:05:04 djb38, mak4300:05:35 jp238, mak4300:10:40 GroupMin. Time djb38, jp238, mak43 00:05:04 jp238, mak4300:10:40
12
12 Complexity Analysis R - total number of Bluetooth records N - total number of users in the dataset L - maximum number of users in a group ◦ Small value because relatively few users are in the transmission range (10m) ◦ Our experiments: max = 15, avg = 6.8 Creating Pair-Wise Meeting RecordsO(R) Creating User ClustersO(R * 2 L ) Creating Global Clusters O(N * 2 L ) Selecting the User GroupsO(R * 2 L ) Total ComplexityO(R * 2 L ), R>> N
13
13 Evaluation Goals ◦ Analyze effect of group meeting frequency and time ◦ Compare GDC and K-Clique K-Clique uses a time threshold to select graph edges and analyzes the graph for k-cliques Experiments ◦ Collect data from mobile phones carried by 100+ volunteer students on campus for one month ◦ Run GDC and K-Clique on collected data Also tested on Reality Mining data from MIT ◦ Ask users to rank groups using Likert Scale 1 to 5, 5 is best
14
14 Data Collection Details 78 users each contributed less than 24 hours of recorded data Sparse data: random volunteers, many students are commuters Demographics: 72% male, 28% female, 25% graduate, 75% undergraduate
15
15 Effect of Meeting Time and Frequency Detection accuracy increases significantly with meeting frequency and total meeting time
16
16 GDC vs. K-Clique Overall, GDC groups rated 30% better than the popular K-Clique algorithm ◦ GDC groups are guaranteed to meet ◦ Not all K-Clique groups meet Some GDC groups are rated poorly because members don’t know their names GDC: MGT = 2000s MGMF = 2 K-Clique: Threshold 2000s
17
17 GDC Groups: NJIT Dataset vs. Reality Mining Dataset Group distributions as a function of size are relatively similar despite the fact that Reality Mining is a denser dataset NJIT: MGT = 2000s, MGMF = 1 Reality Mining: MGT = 18000s, MGMF = 9 (normalized for 9 months)
18
18 Outline GDC Algorithm User Study Results Distributed GDC Conclusions
19
19 Distributed GDC (D-GDC) GDC executed on the phones Benefits ◦ Better privacy Avoid “Big Brother” scenario Ability to control message exchange on a per-case basis ◦ Resiliency: no bottleneck & no single point of failure ◦ Flexibility: each user controls how often to run D-GDC
20
20 D-GDC Implementation Collect Bluetooth records locally through message exchange ◦ No global aggregation like in GDC Control exchange with heuristic policies ◦ These policies can be specified by users ◦ Allows greater individual privacy control Run remainder of GDC device-local Evaluated using replay simulation over our real traces
21
Preliminary Results Overall similarity: compute similarity of each user’s GDC groups against the closest matches in D-GDC and average the results Compared D-GDC with a version running only on data collected locally by phones ◦ D-GDC performs significantly better than local-only version D-GDCLocal only Average similarity77.33%58.24% Groups with similarity > 90%59.77%19.14% 21
22
22 Conclusion Physical groups enable new socially-aware features in applications GDC: practical, high-accuracy, no location collection ◦ Validated by users and outperforms K-Clique by 30% ◦ Higher accuracy can be achieved by increasing frequency and time parameters A decentralized version improves privacy and produces promising results
23
23 Thank You! Mobius project: http://www.cs.njit.edu/~borcea/mobius/ http://www.cs.njit.edu/~borcea/mobius/ Acknowledgement: NSF grants CNS-0831753 and CNS-0834585
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.