Treffer: Investigating the Extent to which Distributional Semantic Models Capture a Broad Range of Semantic Relations.
Original Publication: Norwood, N. J., Ablex Pub. Corp.
Agirre, E., Alfonseca, E., Hall, K., Kravalova, J., Pasca, M., & Soroa, A. (2009). A study on similarity and relatedness using distributional and wordnet-based approaches. Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, NAACL ’09, Boulder, CO (pp. 19-27).
Andrews, M., Vigliocco, G., & Vinson, D. P. (2009). Integrating experiential and distributional data to learn semantic representations. Psychological Review, 116(3), 463-498.
Apfelbaum, K. S., Goodwin, C., Blomquist, C., & McMurray, B. (2023). The development of lexical competition in written-and spoken-word recognition. Quarterly Journal of Experimental Psychology, 76(1), 196-219.
Asr, F. T., Zinkov, R., & Jones, M. (2018). Querying word embeddings for similarity and relatedness. Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1: Long Papers, New Orleans, LA (pp. 675-684).
Baroni, M., & Lenci, A. (2010). Distributional memory: A general framework for corpus-based semantics. Computational Linguistics, 36, 673-721.
Baroni, M., Dinu, G., & Kruszewski, G. (2014). Don't count, predict! A systematic comparison of context-counting vs. context-predicting semantic vectors. Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics; Volume 1: Long Papers, Baltimore, MD (pp. 238-247).
Bommasani, R., Davis, K., & Cardie, C. (2020). Interpreting pretrained contextualized representations via reductions to static embeddings. Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, Online (pp. 4758-4781).
Brysbaert, M., Warriner, A. B., & Kuperman, V. (2013). Concreteness ratings for 40 thousand generally known English word lemmas. Behavior Research Methods, 46, 904-911.
Buchanan, E. M., Valentine, K. D., & Maxwell, N. P. (2019). English semantic feature production norms: An extended database of 4436 concepts. Behavior Research Methods, 51, 1849-1863.
Bullinaria, J., & Levy, J. P. (2007). Extracting semantic representations from word co-occurrence statistics: A computational study. Behavior Research Methods, 39, 510-526.
Carlson, G. N., & Tanenhaus, M. K. (1988). Thematic roles and language comprehension. Syntax and Semantics, 21, 263-288.
Chronis, G., & Erk, K. (2020). When is a bishop not like a rook? When it's like a rabbi! Multi-prototype BERT embeddings for estimating semantic relationships. Proceedings of the 24th Conference on Computational Natural Language Learning, Online (pp. 227-244).
Collins, A. M, & Quillian, M. R. (1969). Retrieval time from semantic memory. Journal of Verbal Learning and Verbal Behavior, 8(2), 240-247.
Connell, L., & Lynott, D. (2014). I see/hear what you mean: Semantic activation in visual word recognition depends on perceptual attention. Journal of Experimental Psychology: General, 143(2), 527.
De Deyne, S., Navarro, D. J., & Storms, G. (2013). Better explanations of lexical and semantic cognition using networks derived from continued rather than single-word associations. Behavior Research Methods, 45, 480-498.
De Deyne, S., Navarro, D. J., Perfors, A., Brysbaert, M., & Storms, G. (2019). The “Small World of Words” English word association norms for over 12,000 cue words. Behavior Research Methods, 51, 987-1006.
Dennis, S. (2005). A memory-based theory of verbal cognition. Cognitive Science, 29, 145-193.
Desroches, A. S., Friesen, D. C., Teles, M., Korade, C. A., & Forest, E. W. (2022). The dynamics of spoken word recognition in bilinguals. Bilingualism: Language and Cognition, 25(4), 705-710.
Devlin, J., Chang, M.-W., Lee, K., & Toutanova, K. (2019) BERT: Pre-training of deep bidrectional transformere for language understanding. arXiv, 1810.04805.
Dresang, H. C., Dickey, M. W., & Warren, T. C. (2019). Semantic memory for objects, actions, and events: A novel test of event-related conceptual semantic knowledge. Cognitive Neuropsychology, 36, 313-335.
Erk, K., Pado, S., & Pado, U. (2010). A flexible, corpus-driven model of regular and inverse selectional preferences. Computational Linguistics, 36, 723-763.
Estes, Z., Golonka, S., & Jones, L. L. (2011). Thematic thinking: The apprehension and consequences of thematic relations. Psychology of Learning and Motivation: Advances in Research and Theory, 54, 249-294.
Ethayarajh, K. (2019). How contextual are contextualized word representations? Comparing the geometry of BERT, ELMo, and GPT-2 embeddings. Proceedings of the 29th Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, Hong Kong, China (pp. 55-65).
Ettinger, A. (2020). What BERT is not: Lessons from a new suite of psycholinguistic diagnostics for language models. Transactions of the Association for Computational Linguistics, 8, 34-48.
Hill, F., Reichart, R., & Korhonen, A. (2015). Simlex-999: Evaluating semantic models with genuine similarity estimation. Computational Linguistics, 41, 665-695.
Fernandino, L., Tong, J. Q., Conant, L. L., Humphries, C. J., & Binder, J. R. (2022). Decoding the information structure underlying the neural representation of concepts. Proceedings of the National Academy of Sciences, 119(6), e2108091119.
Ferretti, T. R., McRae, K., & Hatherell, A. (2001). Integrating verbs, situation schemas, and thematic role concepts. Journal of Memory and Language, 44, 516-547.
Finkelstein, L., Gabrilovich, E., Matias, Y., Rivlin, E., Solan, Z., Wolfman, G., & Ruppin, E. (2002). Placing search in context: The concept revisited. Proceedings of the 10th International Conference on World Wide Web, Hong Kong, Hong Kong (pp. 406-414).
Gentner, D., & Boroditsky, L. (2001). Individuation, relativity, and early word learning. In Language Acquisition and Conceptual Devlopment, M. Bowerman and S. C. Levinson ed., Cambridge University Press, Cambridge, UK.
Gerz, D., Vuli, I., Hill, F., Reichart, R., & Korhonen, A. (2016). SimVerb-3500: A large-scale evaluation set of verb similarity. Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, Austin, TX (pp. 2173-2182).
Ghosh, V. E., & Gilboa, A. (2014). What is a memory schema? A historical perspective on current neuroscience literature. Neuropsychologia, 53, 104-114.
Giovannone, N., & Theodore, R. M. (2021). Individual differences in lexical contributions to speech perception. Journal of Speech, Language, and Hearing Research, 64(3), 707-724.
Goh, W. D., Yap, M. J., & Chee, Q. W. (2020). The Auditory English Lexicon Project: A multi-talker, multi-region psycholinguistic database of 10,170 spoken words and nonwords. Behavior Research Methods, 52(5), 2202-2231.
Griffiths, T. L., Steyvers, M., & Tenenbaum, J. B. (2007). Topics in semantic representation. Psychological Review, 114, 211-244.
Hare, M., Jones, M. N., Thomson, C., Kelly, S., & McRae, K. (2009). Activating event knowledge. Cognition, 111, 151-167.
Hills, T. T., Maouene, M., Maouene, J., Sheya, A., & Smith, L. (2009). Longitudinal analysis of early semantic networks: Preferential attachment or preferential acquisition? Psychological Science, 20, 729-739.
Hutchison, K. A., Balota, D. A., Neely, J. H., Cortese, M. J., Cohen-Shikora, E. R., Tse, C. S., Yap, M. J., Bengson, J. J., Niemeyer, D., & Buchanan, E. (2013). The semantic priming project. Behavior Research Methods, 45, 1099-1114.
Johns, B. T., Mewhort, D. J. K., & Jones, M. N. (2019). The role of negative information in distributional semantic learning. Cognitive Science, 43, e12730.
Johns, B. T. (2021). Distributional social semantics: Inferring word meanings from communication patterns. Cognitive Psychology, 131, 101441.
Jones, M. N., Kintsch, W., & Mewhort, D. J. K. (2006). High-dimensional semantic space accounts of priming. Journal of Memory and Language, 55, 534-552.
Jouravlev, O., & McRae, K. (2016). Thematic relatedness production norms for 100 object concepts. Behavior Research Methods, 48, 1349-1357.
Kacmajor, M., & Kelleher, J. D. (2020). Capturing and measuring semantic relations. Language Resources & Evaluation, 54, 645-682.
Kersten, A. W., & Earles, J. L. (2004). Semantic context influences memory for verbs more than memory for nouns. Memory & Cognition, 32, 198-211.
Kolde, R., Laur, S., Adler, P., & Vilo, J. (2012). Robust rank aggregation for gene list integration and meta-analysis. Bioinformatics, 28, 573-580.
Kumar, A., Steyvers, M., & Balota, B. (2021). DA critical review of network-based and distributional approaches to semantic memory structure and processes. Topics in Cognitive Science, 14(1), 54-77.
Kwantes, P. J. (2005). Using context to build semantics. Psychonomic Bulletin & Review, 12, 703-710.
Landauer, T. K., & Dumais, S. (1997). A solution to Plato's problem: The Latent Semantic Analysis theory of the acquisition, induction, and representation of knowledge. Psychological Review, 104, 211-240.
Lake, B. M., & Murphy, G. L. (2021). Word meaning in minds and machines. Psychological Review, 130(2), 401-431. Advance online publication, https://doi.org/10.1037/rev0000297.
Lapesa, G., & Evert, S. (2013). Evaluating neighbor rank and distance measures as predictors of semantic priming. Proceedings of the ACL Workshop on Cognitive Modeling and Computational Linguistics (CMCL 2013), Minneapolis, MN (pp. 66-74.
Lapesa, G., Evert, S., & Schulte im Walde, S. (2014). Contrasting syntagmatic and paradigmatic relations: Insights from distributional semantic models. Proceedings of the Third Joint Conference on Lexical and Computational Semantics (SEM 2014), Dublin, Ireland (pp. 160-170.
Lazaridou, A., Pham, N. T., & Baroni, M. (2015). Combining language and vision with a multimodal Skip-gram model. arXiv:1501.02598.
Lenci, A. (2018). Distributional models of word meaning. Annual review of Linguistics, 4, 151-171.
Lenci, A., Sahlgren, M., Jeuniaux, P., Gyllensten, A. M., & Miliani, M. (2022). A comparative evaluation and analysis of three generations of Distributional Semantic Models. Language Resources and Evaluation, 56, 1269-1313.
Levy, O., Goldberg, Y., & Dagan, I. (2015). Improving distributional similarity with lessons learned from word embeddings. Transactions of the Association of Computational Linguistics, 3, 211-225.
Lewis, M., Zettersten, M., & Lupyan, G. (2019). Distributional semantics as a source of visual knowledge. Proceedings of the National Academy of Sciences, 116(39), 19237-19238.
Liu, N. F., Gardner, M., Belinkov, Y., Peters, M. E., & Smith, N. A. (2019). Linguistic knowledge and transferability of contextual representations. Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), Minneapolis, MN (pp. 1073-1094.
Lund, K., & Burgess, C. (1996). Producing high-dimensional semantic spaces from lexical co-occurrence. Behavioral Research Methods, Instrumentation, and Computers, 28, 203-208.
Mandera, P., Keuleers, E., & Brysbaert, M. (2017). Explaining human performance in psycholinguistic tasks with models of semantic similarity based on prediction and counting: A review and empirical validation. Journal of Memory and Language, 92, 57-78.
Martin, A. (2007). The representation of object concepts in the brain. Annual Review of Psychology, 58, 25-45.
McMurray, B., Apfelbaum, K. S., & Tomblin, J. B. (2022). The slow development of real-time processing: Spoken-word recognition as a crucible for new thinking about language acquisition and language disorders. Current Directions in Psychological Science, 31(4), 305-315.
McNamara, T. (2005). Semantic priming. New York, NY: Psychology Press.
McRae, K., Cree, G. S., Seidenberg, M. S., & McNorgan, C. (2005). Semantic feature production norms for a large set of living and nonliving things. Behavior Research Methods, 37, 547-559.
McRae, K., Khalkhali, S., & Hare, M. (2012). Semantic and associative relations: Examining a tenuous dichotomy. In V. F. Reyna, S. B. Chapman, M. R. Dougherty, & J. Confrey (Eds.), The adolescent brain: Learning, reasoning, and decision making (pp. 39-66). Washington, DC: APA.
McRae, K., Spivey-Knowlton, M. J., & Tanenhaus, M. K. (1998). Modeling the influence of thematic fit (and other constraints) in on-line sentence comprehension. Journal of Memory and Language, 38, 283-312.
Mikolov, T., Chen, K., Corrado, G., & Dean, J. (2013a). Efficient estimation of word representations in vector space. arXiv:1301.3781.
Mikholov, T., Sutskever, I., Chen, K., Corrado, G., & Dean, J. (2013b). Distributed representations of words and phrases and their compositionality. Advances in Neural Information Processing Systems, 28 (2013), Lake Tahoe, NV.
Miller, G. A. (1995). WordNet: A lexical database for English. Communications of the ACM, 38(11), 39-41.
Mirman, D., & Magnuson, J. S. (2008). Attractor dynamics and semantic neighborhood density: Processing is slowed by near neighbors and speeded by distant neighbors. Journal of Experimental Psychology: Learning, Memory, and Cognition, 34, 65-79.
Mohajer, M., Englmeier, K.-H., & Schmid, V. J. (2010). A comparison of Gap statistic with and with-out logarithm function. Technical Report No. 096, Department of Statistics, University of Munich.
Mur, M., Meys, M., Bodurka, J., Goebel, R., Bandettini, P. A., & Kriegeskorte, N. (2013). Human object-similarity judgements reflect and transcend the primate-IT object representation. Frontiers in Psychology, 4, 128.
Musz, E., Yee, E., & Thompson-Schill, S. L. (2012). Mapping the similarity space of concepts in sensorimotor cortex. Poster presented at the 2012 Meeting of the Cognitive Neuroscience Society, Chicago, IL:.
Nelson, D. L., McEvoy, C. L., & Schreiber, T. A. (2004). The University of South Florida free association, rhyme, and word fragment norms. Behavior Research Methods, Instruments, & Computers, 36, 402-407.
Nenadić, F., Podlubny, R. G., Schmidtke, D., Kelley, M. C., & Tucker, B. V. (2022). Semantic richness effects in isolated spoken word recognition: Evidence from massive auditory lexical decision. Journal of Experimental Psychology: Learning, Memory, and Cognition. Advance online publication. https://doi.org/10.1037/xlm0001208.
Nitsan, G., Banai, K., & Ben-David, B. M. (2022). One size does not fit all: Examining the effects of working memory capacity on spoken word recognition in older adults using eye tracking. Frontiers in Psychology, 13, 841466.
Papies, E. K. (2013). Tempting food words activate eating simulations. Frontiers in Psychology, 4, 838.
Pennington, J., Socher, R., & Manning, C. D. (2014). Glove: Global vectors for word representation. Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), Doha, Qatar (pp. 1532-1543).
Pexman, P. M., Hargreaves, I. S., Siakaluk, P. D., Bodner, G. E., & Pope, J. (2008). There are many ways to be rich: Effects of three measures of semantic richness on visual word recognition. Psychonomic Bulletin & Review, 15, 161-167.
Pexman, P. M., Heard, A., Lloyd, E., & Yap, M. J. (2017). The Calgary semantic decision project: Concrete/abstract decision data for 10,000 English words. Behavior Research Methods, 49, 407-417.
Poeppel, D., & Idsardi, W. (2022). We don't know how the brain stores anything, let alone words. Trends in Cognitive Sciences, 26(12), 1054-1055. https://doi.org/10.1016/j.tics.2022.08.010.
Rabovsky, M., & McRae, K. (2014). Simulating the N400 ERP component as semantic network error: Insights from a feature-based connectionist attractor model of word meaning. Cognition, 132, 68-89.
Rabs, E., Delogu, F., Drenhaus, H., & Crocker, M. W. (2022). Situational expectancy or association? The influcence of event knowledge on the N400. Language, Cognition, and Neuroscience, 37, 766-784.
Rapp, R. (2002). The computation of word associations: Comparing syntagmatic and paradigmatic approaches. Proceedings of the 19th International Conference on Computational Linguistics, New Delhi, India (pp. 1-7).
Rehurek, R., & Sojka, P. (2011). Gensim-python framework for vector space modelling. NLP Centre, Faculty of Informatics, Masaryk University, Brno, Czech Republic, 3(2).
Rogers, T. T., Lambon Ralph, M. A., Garrard, P., Bozeat, S., McClelland, J. L., Hodges, J. R., & Patterson, K. (2004). The structure and deterioration of semantic memory: A neuropsychological and computational investigation. Psychological Review, 111, 205-235.
Rotaru, A. S., Vigliocco, G., & Frank, S. L. (2018). Modeling the structure and dynamics of semantic processing. Cognitive Science, 42, 2890-2917.
Saffran, J. R., Aslin, R. N., & Newport, E. L. (1996). Statistical learning by 8-month-old infants. Science, 274(5294), 1926-1928.
Sahlgren, M. (2006). The word-space model: Using distributional analysis to represent syntagmatic and paradigmatic relations between words in high dimensional vector spaces. [PhD dissertation], University of Stockholm.
Santus, E., Chersoni, E., Lenci, A., & Blache, P. (2017). Measuring thematic fit with distributional feature overlap. Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, Copenhagen, Denmark (pp. 659-669).
Sayeed, A., Greenberg, C., & Demberg, V. (2016). Thematic fit evaluation: An aspect of selectional preferences. Proceedings of ACL Workshop for Evaluating Vector Space Representations for NLP, Berlin, Germany.
Sokal, R., & Michener, C. (1958). A statistical method for evaluating systematic relationships. University of Kansas Science Bulletin, 38, 1409-1438.
Stella, M., Beckage, N. M., & Brede, M. (2017). Multiplex lexical networks reveal patterns in early word acquisition in children. Scientific Reports, 7, 1-10.
Taylor, W. L. (1953). Cloze procedure: A new tool for measuring readability. Journalism Bulletin, 30(4), 415-433.
Tenney, I., Xia, P., Chen, B., Wang, A., Poliak, A., McCoy, R. T., Kim, N., Van Durme, B., Bowman, S. R., Das, D., & Pavlick, E. (2019). What do you learn from context? Probing for sentence structure in contextualized word representations. International Conference on Learning Representations 2019, New Orleans, LA.
Tilk, O., Demberg, V., Sayeed, A., Klakow, D., & Thater, S. (2016). Event participant modelling with neural networks. Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, Austic, TX (pp. 171-182).
Troyer, M., & Kutas, M. (2020). To catch a Snitch: Brain potentials reveal variability in the functional organization of (fictional) world knowledge during reading. Journal of Memory & Language, 113, 104-111.
Utsumi, A. (2015). A complex network approach to Distributional Semantic Models. PLoS ONE, 10(8), e0136277. https://doi.org/10.1371/journal.pone.0136277.
Vasmani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., Kaiser, L., & Polosukhin, I. (2017). Attention is all you need. National Information Processing Systems, NIPS 2017, Long Beach, CA.
Vinson, D. P., & Vigliocco, G. (2008). Semantic feature production norms for a large set of objects and events. Behavior Research Methods, 40, 183-190.
Wang, F., Kaneshiro, B., Strauber, C.B., Hasak, L., Nguyen, Q. T. H., Yakovleva, A., Vildavski, V. Y., Norcia, A. M., & McCandliss, B. D. (2021). Distinct neural sources underlying visual word form processing as revealed by steady state visual evoked potentials (SSVEP). Scientific Reports, 11, 18229.
Wingfield, C., & Connell, L. (2022). Understanding the role of linguistic distributional knowledge in cognition. Language, Cognition and Neuroscience, 37(10), 1220-1270.
Yee, E., Ahmed, S. Z., & Thompson-Schill, S. L. (2012). Colorless green ideas (can) prime furiously. Psychological Science, 23(4), 364-369.
Yee, E., Drucker, D. M., & Thompson-Schill, S. L. (2010). fMRI-adaptation evidence of overlapping neural representations for objects related in function or manipulation. NeuroImage, 50, 753-763.
Yee, E., Huffstetler, S., & Thompson-Schill, S. L. (2011). Function follows form: Activation of shape and function features during object identification. Journal of Experimental Psychology: General, 140, 348-363.
Zacks, J. M. (2020). Event perception and memory. Annual Review of Psychology, 71, 165-191.
Weitere Informationen
Distributional semantic models (DSMs) are a primary method for distilling semantic information from corpora. However, a key question remains: What types of semantic relations among words do DSMs detect? Prior work typically has addressed this question using limited human data that are restricted to semantic similarity and/or general semantic relatedness. We tested eight DSMs that are popular in current cognitive and psycholinguistic research (positive pointwise mutual information; global vectors; and three variations each of Skip-gram and continuous bag of words (CBOW) using word, context, and mean embeddings) on a theoretically motivated, rich set of semantic relations involving words from multiple syntactic classes and spanning the abstract-concrete continuum (19 sets of ratings). We found that, overall, the DSMs are best at capturing overall semantic similarity and also can capture verb-noun thematic role relations and noun-noun event-based relations that play important roles in sentence comprehension. Interestingly, Skip-gram and CBOW performed the best in terms of capturing similarity, whereas GloVe dominated the thematic role and event-based relations. We discuss the theoretical and practical implications of our results, make recommendations for users of these models, and demonstrate significant differences in model performance on event-based relations.
(© 2023 Cognitive Science Society LLC.)