li iwqos 2019 slides

Optimal Mobile Users for Long-term Environmental Monitoring by Crowdsourcing Juan Li, Jie Wu, and Yanmin Zhu Shanghai Ji...

0 downloads 89 Views 2MB Size
Optimal Mobile Users for Long-term Environmental Monitoring by Crowdsourcing Juan Li, Jie Wu, and Yanmin Zhu Shanghai Jiao Tong University Temple University

1

Road Map l Introduction l Model

and Formulation

l Observation

and Idea

l Algorithms l Experiments l Conclusion

2

Background

l High deployment and maintenance cost l Monitoring is coarse-grained

Pre-deployed sensor network

Air pollution map

l Cheap; l Fine-grained monitoring

Crowdsourcing over mobile devices

3

Problem The air pollution map is updated in each time slot How to get accurate monitoring maps under the limited budget?

l The average payment should be below ๐ต"#$ . Some grids do not have measurements l The payment in each slot cannot exceed ๐›ฝ. because of the budget limitation.

4

Road Map l Introduction l Model

and Formulation

l Observation

and Idea

l Algorithms l Experiments l Conclusion

5

Model and Formulation Crowdsourcer At which grid, cost

Sensing data

Recruit or not

Payment

Data Collection App

Mobile user Storage

Sensors

Mobile users arriving online t

โ€ฆ Slot 0

Update map

Slot 1

Update map

Slot T-1

Update map

6

Objective in a slot -Data Utility Importance level

๏ผŸ

๏ผŸ

2

3

๏ผŸ

2

4

4

๏ผŸ

3

1

3

Initial distribution

Gaussian Process

More accurate distribution

Collects pollution levels of grids in set A

3

l Measurements at important grids can be collected. l The crowdsourcer is sure of inferences of other grids.

Air pollution levels of all girds in the universal set U~N(๐œ‡, โƒ— ฮฃ)(known)

Air pollution levels of girds in set B=U\A ~N(๐œ‡โƒ—- , ฮฃ- )

7

Gaussian Process

๐‘ฆAir pollution levels of girds in the universal set U: ๐‘ฆโƒ— = ๐‘ฆ0 ๐‘ฆ- is known

๐‘ฆ๐‘ข ฮฃ ,ฮฃ ~N( - , -- -0 )(known) ๐‘ฆ0 ฮฃ0- , ฮฃ00 ๐‘ข0 Initial entropy of ๐‘ฆ0 :

H ๐‘ฆ0

1 = ln[ 2๐œ‹๐‘’ 2

0

67 ๐‘ข0|- = ๐‘ข0 + ๐›ด 0- ๐›ด -(๐‘ฆ- โˆ’ ๐‘ข- ) ฮฃ0|- = ๐›ด 00 โˆ’ ๐›ด 0- ๐›ด -67 ๐›ด -0

Conditional entropy of ๐‘ฆ0 |๐‘ฆ- :

|ฮฃ00 |]

H ๐‘ฆ0 |๐‘ฆ-

1 = ln[ 2๐œ‹๐‘’ 2

0

|ฮฃ0|- |]

Data utility in a slot= entropy decrease of unknown grids + sum of importance levels of known grids

8

Problem Formulation Max S.t.

7 C lim ฮฃ (data utility in slot t) C C6FG 7

The total payment in each slot โ‰ค ๐›ฝ The average of the total payment in each slot โ‰ค ๐ต"#$ Two challenges: l Online problem l NP-hard problem

9

Road Map l Introduction l Model

and Formulation

l Algorithm

Design

l Experiments l Conclusion

10

Decoupling the Long-term Problem The virtual queue length Q(t) represents the over used budget in the past

Lyapunov optimization

l Q(t+1)=Q(t) +max[the total payment in slot t-๐ต"#$ ,0]

l ฮ”Q t = E

7 M

M

7

Q ๐‘ก โˆ’ ๐‘„M ๐‘ก โˆ’ 1 ๐‘„ ๐‘ก โˆ’ 1 M

measures the increase of

the queue length between two slots.

11

Decoupling the Long-term Problem The long-term problem Max the average data utility S.t. The upper bound of the payment in each slot The average budget constraint

The short-term problem in slot t Max ๐‘ฎ๐’• =V *data utility in slot t- ฮ”Q t S.t. The upper bound of the payment in slot t is ๐›ฝ

Theoretical performance: l The average budget constraint can be satisfied as long as the time is long enough. l If ๐บS โ‰ฅ ๐‘’๐บSโˆ— ( e is the competitive ratio for the one-slot problem), 1 a small constant C

lim ฮฃ7 ๐‘‘๐‘Ž๐‘ก๐‘Ž ๐‘ข๐‘ก๐‘–๐‘™๐‘–๐‘ก๐‘ฆ ๐‘–๐‘› ๐‘ ๐‘™๐‘œ๐‘ก ๐‘ก

๐‘‡ C6FG โ‰ฅ ๐‘’ร— ๐‘กโ„Ž๐‘’ ๐‘œ๐‘๐‘ก๐‘–๐‘š๐‘Ž๐‘™ ๐‘Ž๐‘ฃ๐‘’๐‘Ÿ๐‘Ž๐‘”๐‘’ ๐‘ข๐‘ก๐‘–๐‘™๐‘–๐‘ก๐‘ฆ โˆ’ ๐ท/๐‘‰

12

Online Mobile User Selection Algorithm in a slot l When a mobile user comes, the crowdsourcer recruits the user if l The marginal contribution/the cost โ‰ฅ a threshold l the remaining budget can afford the cost l The threshold is updated at the end of each stage i. l Using the users arriving before and ๐ตh as input, we can obtain an offline objective ๐บh by a greedy strategy (greedy metric is contribution/cost of user) l The threshold is updated as ๐บh /(6๐ตh )

13

Road Map l Introduction l Model

and Formulation

l Algorithm

Design

l Experiments l Conclusion

14

Experiments l

Data sets: the air pollution data in Beijing; the real human trajectory data.

l

Baselines: Cost First, Shortsighted UPR and Shortsighted AVG

l

Settings: the number of slots is 2800; l the weight V is 10; l the upper bound of the one-slot payment is $700; l the average budget is $500; l the area of one grid is 5km*5km; l the importance levels are {1,2,3,4,5}; l the cost of each mobile user is form $0.2 to $1.5. l

15

Experiments

16

16

Experiments

17

17

Road Map l Introduction l Model

and Formulation

l Algorithm

Design

l Experiments l Conclusion

18

Conclusion l In this paper, we have studied the data utility maximization problem under the average budget constraint in environmental monitoring. l We come up with a novel data utility model to measure how good a set of data is. l We further design an online algorithm to solve the long-term problem.

19