Compositionality

Multi-word Compositionality Problem Definition Background Previous Approaches My Observations Summary References ...

0 downloads 76 Views 321KB Size
Multi-word

Compositionality

Problem Definition

Background

Previous Approaches

My Observations

Summary

References

Think Tank vs Water Tank: Compositionality Detection using Vector Space Model Siva Reddy Artificial Intelligence Group Department of Computer Science University of York

Research Student Seminar Jan 24 2011

Siva Reddy (UoY)

Think Tank vs Water Tank: Compositionality Detection using Vector Space Model

Multi-word

Compositionality

Problem Definition

Background

Previous Approaches

My Observations

Summary

References

Outline

1

Multi-word

2

Compositionality

3

Problem Definition

4

Background

5

Previous Approaches

6

My Observations

Siva Reddy (UoY)

Think Tank vs Water Tank: Compositionality Detection using Vector Space Model

Multi-word

Compositionality

Problem Definition

Background

Previous Approaches

My Observations

Summary

References

Multi-word

A sequence of two or more words describing a meaning together. Compound Nouns credit card leather jacket

Phrasal Verbs look up get over

Idiomatic expressions kick the bucket spill the beans

Siva Reddy (UoY)

Think Tank vs Water Tank: Compositionality Detection using Vector Space Model

Multi-word

Compositionality

Problem Definition

Background

Previous Approaches

My Observations

Summary

References

Compositionality

Given meanings of Think Water Tank Can we interpret the meanings of Think Tank Water Tank

Siva Reddy (UoY)

Think Tank vs Water Tank: Compositionality Detection using Vector Space Model

Multi-word

Compositionality

Problem Definition

Background

Previous Approaches

My Observations

Summary

References

Compositionality

Given meanings of Think Water Tank Can we interpret the meanings of Think Tank Water Tank

Siva Reddy (UoY)

Think Tank vs Water Tank: Compositionality Detection using Vector Space Model

Multi-word

Compositionality

Problem Definition

Background

Previous Approaches

My Observations

Summary

References

Compositionality

A multi-word “A B” is compositional if meaning(A B) = meaning(A) + meaning(B) e.g. Water Tank e.g. Post Man Non-Compositional Think Tank Smoking Gun Apple Polisher

Siva Reddy (UoY)

Think Tank vs Water Tank: Compositionality Detection using Vector Space Model

Multi-word

Compositionality

Problem Definition

Background

Previous Approaches

My Observations

Summary

References

Compositionality

A multi-word “A B” is compositional if meaning(A B) = meaning(A) + meaning(B) e.g. Water Tank e.g. Post Man Non-Compositional Think Tank Smoking Gun Apple Polisher

Siva Reddy (UoY)

Think Tank vs Water Tank: Compositionality Detection using Vector Space Model

Multi-word

Compositionality

Problem Definition

Background

Previous Approaches

My Observations

Summary

References

Problem Definition

Given: Large Web text of a language English: 15 billion word corpora German: 1 billion word corpora Telugu: 10 million word corpora Goal: Identify compositional and non-compositional multi-words. My focus is on compound nouns A sequence of nouns is treated as a multi-word

Siva Reddy (UoY)

Think Tank vs Water Tank: Compositionality Detection using Vector Space Model

Multi-word

Compositionality

Problem Definition

Background

Previous Approaches

My Observations

Summary

References

Problem Definition

Given: Large Web text of a language English: 15 billion word corpora German: 1 billion word corpora Telugu: 10 million word corpora Goal: Identify compositional and non-compositional multi-words. My focus is on compound nouns A sequence of nouns is treated as a multi-word

Siva Reddy (UoY)

Think Tank vs Water Tank: Compositionality Detection using Vector Space Model

Multi-word

Compositionality

Problem Definition

Background

Previous Approaches

My Observations

Summary

References

Importance

Dictionary Building A good dictionary does not miss non-compositional multi-words Machine Translation Non-compositional words should be treated as a single word goose egg 6= Gänseei goose egg → unwichtig Word Tokenization Search engines

Siva Reddy (UoY)

Think Tank vs Water Tank: Compositionality Detection using Vector Space Model

Multi-word

Compositionality

Problem Definition

Background

Previous Approaches

My Observations

Summary

References

Background: Foundations of Semantics

Distributional Hypothesis (Harris, 1954) Words that occur in similar contexts tend to have similar meanings e.g. Tree and Plant, Tea and Coffee, Bus and Vehicle Bag of words hypothesis: Two documents tend to be similar if they have same distribution of similar words You shall know a word by the company it keeps (Firth, 1957)

Siva Reddy (UoY)

Think Tank vs Water Tank: Compositionality Detection using Vector Space Model

Multi-word

Compositionality

Problem Definition

Background

Previous Approaches

My Observations

Summary

References

Vector Space Models (VSMs) of Semantics

Interpret semantics using VSM Backbone: Distributional Hypothesis

Text entity (we are interested in) as a Vector (point) in dimensional space. Existing methods representation term-document term-context

Siva Reddy (UoY)

Think Tank vs Water Tank: Compositionality Detection using Vector Space Model

Multi-word

Compositionality

Problem Definition

Background

Previous Approaches

My Observations

Summary

References

Term-Document: (Salton et al., 1975)

1

d1: Human machine interface for Lab ABC computer applications

1

Image courtesy: (Landauer et al., 1998)

Siva Reddy (UoY)

Think Tank vs Water Tank: Compositionality Detection using Vector Space Model

Multi-word

Compositionality

Problem Definition

Background

Previous Approaches

My Observations

Summary

References

Term-Document: (Salton et al., 1975)

2

Document similarity can be found using Cosine similarity D1.D2 sim(D1, D2) = kD1 kkD2k

2

Image courtesy: (Salton et al., 1975)

Siva Reddy (UoY)

Think Tank vs Water Tank: Compositionality Detection using Vector Space Model

Multi-word

Compositionality

Problem Definition

Background

Previous Approaches

My Observations

Summary

References

Term-Document: (Salton et al., 1975)

2

Document similarity can be found using Cosine similarity D1.D2 sim(D1, D2) = kD1 kkD2k

2

Image courtesy: (Salton et al., 1975)

Siva Reddy (UoY)

Think Tank vs Water Tank: Compositionality Detection using Vector Space Model

Multi-word

Compositionality

Problem Definition

Background

Previous Approaches

My Observations

Summary

References

Term-Context: Word Space Model

Words are represented as a vector build from context words I rent a house. I bought an apartment. I booked a room. Siva Reddy (UoY)

Think Tank vs Water Tank: Compositionality Detection using Vector Space Model

Multi-word

Compositionality

Problem Definition

Background

Previous Approaches

My Observations

Summary

References

Previous Approaches: Lin (1999)

Water Tank substituting thesaurus entries of water Aqua Tank Liquid Tank Petrol Tank Think Tank Cognition Tank Fails Drink Tank 15.7 % accuracy reported

Siva Reddy (UoY)

Think Tank vs Water Tank: Compositionality Detection using Vector Space Model

Multi-word

Compositionality

Problem Definition

Background

Previous Approaches

My Observations

Summary

References

Previous Approaches: Lin (1999)

Water Tank substituting thesaurus entries of water Aqua Tank Liquid Tank Petrol Tank Think Tank Cognition Tank Fails Drink Tank 15.7 % accuracy reported

Siva Reddy (UoY)

Think Tank vs Water Tank: Compositionality Detection using Vector Space Model

Multi-word

Compositionality

Problem Definition

Background

Previous Approaches

My Observations

Summary

References

Baldwin et al. (2003) Build distributional vectors of WaterTank ThinkTank Tank Measure sim(WaterTank, Tank) and sim(ThinkTank, Tank) if sim > thrsh: multi-word is compositional else: multi-word is non-compositional Pitfalls Was able to capture type-of relations only Negative examples: agony aunt

Skewed nature of senses Threshold highly varies River Bank is not similar to Bank

Moderate results: 51 % accuracy Siva Reddy (UoY)

Think Tank vs Water Tank: Compositionality Detection using Vector Space Model

Multi-word

Compositionality

Problem Definition

Background

Previous Approaches

My Observations

Summary

References

Katz and Giesbrecht (2006)

Build distributional vectors of WaterTank ThinkTank Water, Think, Tank

Compositional Vector: WaterTank_comp = Water + Tank sim (WaterTank, WaterTank_comp) if sim > thrsh: multi-word is compositional else: multi-word is non-compositional

Pitfalls: Threshold highly varies 48 % accuracy

Siva Reddy (UoY)

Think Tank vs Water Tank: Compositionality Detection using Vector Space Model

Multi-word

Compositionality

Problem Definition

Background

Previous Approaches

My Observations

Summary

References

My Observations

All my observations are based on UKWaC corpus (UK Web as Corpus) Traffic: 102847 examples Light: 248029 examples TrafficLight: 1070 examples sim(TrafficLight, Traffic)= 0.577 sim(TrafficLight, Light)= 0.282 sim(TrafficLight, TrafficLight_comp)= 0.579

Context Sensitive Vectors TrafficLight : 42160 examples LightTraffic : 73544 example sim(TrafficLight, TrafficLight LightTraffic _comp)= 0.705

Siva Reddy (UoY)

Think Tank vs Water Tank: Compositionality Detection using Vector Space Model

Multi-word

Compositionality

Problem Definition

Background

Previous Approaches

My Observations

Summary

References

My Observations

All my observations are based on UKWaC corpus (UK Web as Corpus) Traffic: 102847 examples Light: 248029 examples TrafficLight: 1070 examples sim(TrafficLight, Traffic)= 0.577 sim(TrafficLight, Light)= 0.282 sim(TrafficLight, TrafficLight_comp)= 0.579

Context Sensitive Vectors TrafficLight : 42160 examples LightTraffic : 73544 example sim(TrafficLight, TrafficLight LightTraffic _comp)= 0.705

Siva Reddy (UoY)

Think Tank vs Water Tank: Compositionality Detection using Vector Space Model

Multi-word

Compositionality

Problem Definition

Background

Previous Approaches

My Observations

Summary

References

My Observations

All my observations are based on UKWaC corpus (UK Web as Corpus) Traffic: 102847 examples Light: 248029 examples TrafficLight: 1070 examples sim(TrafficLight, Traffic)= 0.577 sim(TrafficLight, Light)= 0.282 sim(TrafficLight, TrafficLight_comp)= 0.579

Context Sensitive Vectors TrafficLight : 42160 examples LightTraffic : 73544 example sim(TrafficLight, TrafficLight LightTraffic _comp)= 0.705

Siva Reddy (UoY)

Think Tank vs Water Tank: Compositionality Detection using Vector Space Model

Multi-word

Compositionality

Problem Definition

Background

Previous Approaches

My Observations

Summary

References

My Observations

All my observations are based on UKWaC corpus (UK Web as Corpus) Traffic: 102847 examples Light: 248029 examples TrafficLight: 1070 examples sim(TrafficLight, Traffic)= 0.577 sim(TrafficLight, Light)= 0.282 sim(TrafficLight, TrafficLight_comp)= 0.579

Context Sensitive Vectors TrafficLight : 42160 examples LightTraffic : 73544 example sim(TrafficLight, TrafficLight LightTraffic _comp)= 0.705

Siva Reddy (UoY)

Think Tank vs Water Tank: Compositionality Detection using Vector Space Model

Multi-word

Compositionality

Problem Definition

Background

Previous Approaches

My Observations

Summary

References

My Observations

Threshold is relative High: 495187 examples Jump: 27899 examples HighJump: 1158 sim(HighJump, High)= 0.120 sim(HighJump, Jump)= 0.333 sim(HighJump, HighJump_comp)= 0.333 Is HighJump compositional?

Context Sensitive Vectors HighJump : 84935 examples JumpHigh : 3093 examples sim(HighJump, HighJump JumpHigh _comp)= 0.481

Siva Reddy (UoY)

Think Tank vs Water Tank: Compositionality Detection using Vector Space Model

Multi-word

Compositionality

Problem Definition

Background

Previous Approaches

My Observations

Summary

References

My Observations

Threshold is relative High: 495187 examples Jump: 27899 examples HighJump: 1158 sim(HighJump, High)= 0.120 sim(HighJump, Jump)= 0.333 sim(HighJump, HighJump_comp)= 0.333 Is HighJump compositional?

Context Sensitive Vectors HighJump : 84935 examples JumpHigh : 3093 examples sim(HighJump, HighJump JumpHigh _comp)= 0.481

Siva Reddy (UoY)

Think Tank vs Water Tank: Compositionality Detection using Vector Space Model

Multi-word

Compositionality

Problem Definition

Background

Previous Approaches

My Observations

Summary

References

My Observations

Threshold is relative High: 495187 examples Jump: 27899 examples HighJump: 1158 sim(HighJump, High)= 0.120 sim(HighJump, Jump)= 0.333 sim(HighJump, HighJump_comp)= 0.333 Is HighJump compositional?

Context Sensitive Vectors HighJump : 84935 examples JumpHigh : 3093 examples sim(HighJump, HighJump JumpHigh _comp)= 0.481

Siva Reddy (UoY)

Think Tank vs Water Tank: Compositionality Detection using Vector Space Model

Multi-word

Compositionality

Problem Definition

Background

Previous Approaches

My Observations

Summary

References

Summary

Compositionality detection is tough Vector Space Model for Compositionality Detection Role of context Sensitive vectors Relative thresholds

Siva Reddy (UoY)

Think Tank vs Water Tank: Compositionality Detection using Vector Space Model

Multi-word

Compositionality

Problem Definition

Background

Previous Approaches

My Observations

Summary

References

Suggestions/Questions? Thank You

Siva Reddy (UoY)

Think Tank vs Water Tank: Compositionality Detection using Vector Space Model

Multi-word

Compositionality

Problem Definition

Background

Previous Approaches

My Observations

Summary

References

Bibliography I Baldwin, T., Bannard, C., Tanaka, T., and Widdows, D. (2003). An empirical model of multiword expression decomposability. In Proceedings of the ACL 2003 workshop on Multiword expressions: analysis, acquisition and treatment - Volume 18, pages 89–96, Morristown, NJ, USA. Association for Computational Linguistics. Firth, J. R. (1957). A synopsis of linguistic theory 1930-55. 1952-59:1–32. Harris, Z. (1954). Distributional structure. Word, 10(23):146–162. Katz, G. and Giesbrecht, E. (2006). Automatic identification of non-compositional multi-word expressions using latent semantic analysis. In Proceedings of the Workshop on Multiword Expressions: Identifying and Exploiting Underlying Properties, MWE ’06, pages 12–19, Morristown, NJ, USA. Association for Computational Linguistics. Siva Reddy (UoY)

Think Tank vs Water Tank: Compositionality Detection using Vector Space Model

Multi-word

Compositionality

Problem Definition

Background

Previous Approaches

My Observations

Summary

References

Bibliography II

Landauer, T. K., Foltz, P. W., and Laham, D. (1998). An introduction to latent semantic analysis. Discourse Processes, 25:259–284. Lin, D. (1999). Automatic identification of non-compositional phrases. In Proceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics, ACL ’99, pages 317–324, Stroudsburg, PA, USA. Association for Computational Linguistics. Salton, G., Wong, A., and Yang, C. S. (1975). A vector space model for automatic indexing. Commun. ACM, 18:613–620.

Siva Reddy (UoY)

Think Tank vs Water Tank: Compositionality Detection using Vector Space Model