Symbol grounding problem
A major contributor to this article appears to have a close connection with its subject. (September 2014)
This article may be unbalanced towards certain viewpoints. (December 2010)
The symbol grounding problem is related to the problem of how words (symbols) get their meanings, and hence to the problem of what meaning itself really is. The problem of meaning is in turn related to the problem of consciousness, or how it is that mental states are meaningful. According to a widely held theory of cognition called "computationalism," cognition (i.e., thinking) is just a form of computation. But computation in turn is just formal symbol manipulation: symbols are manipulated according to rules that are based on the symbols' shapes, not their meanings. How are those symbols (e.g., the words in our heads) connected to the things they refer to? It cannot be through the mediation of an external interpreter's head, because that would lead to an infinite regress, just as looking up the meanings of words in a (unilingual) dictionary of a language that one does not understand would lead to an infinite regress. The symbols in an autonomous hybrid symbolic+sensorimotor system—a Turing-scale robot consisting of both a symbol system and a sensorimotor system that reliably connects its internal symbols to the external objects they refer to, so it can interact with them Turing-indistinguishably from the way a person does—would be grounded. But whether its symbols would have meaning rather than just grounding is something that even the robotic Turing test—hence cognitive science itself—cannot determine, or explain.
- 1 Words and meanings
- 2 Consciousness
- 3 Symbol grounding and computation
- 4 Formulation of symbol grounding problem
- 5 Requirements for symbol grounding
- 6 Symbol grounding and Brentano's notion of intentionality
- 7 See also
- 8 Notes
- 9 References
Words and meanings
We know since Frege that the thing that a word refers to (i.e., its referent) is not the same as its meaning. This is most clearly illustrated using the proper names of concrete individuals, but it is also true of names of kinds of things and of abstract properties: (1) "Tony Blair," (2) "the prime minister of the UK during the year 2004," and (3) "Cherie Blair's husband" all have the same referent, but not the same meaning.
Some have suggested that the meaning of a (referring) word is the rule or features that one must use in order to successfully pick out its referent. In that respect, (2) and (3) come closer to wearing their meanings on their sleeves, because they are explicitly stating a rule for picking out their referents: "Find whoever was prime minister of the UK during the year 2004", or whoever is Cherie's current husband". But that does not settle the matter, because there's still the problem of the meaning of the components of that rule ("UK," "during," "current," "PM," "Cherie," "husband"), and how to pick them out.
Perhaps "Tony Blair" (or better still, just "Tony") does not have this recursive component problem, because it points straight to its referent, but how? If the meaning is the rule for picking out the referent, what is that rule, when we come down to non-decomposable components like proper names of individuals (or names of kinds, as in "an unmarried man" is a "bachelor")?
It is probably unreasonable to expect us to know the rule for picking out the intended referents of our words—to know it explicitly, at least. Our brains do need to have the "know-how" to execute the rule, whatever it happens to be: they need to be able to actually pick out the intended referents of our words, such as "Tony Blair" or "bachelor." But we do not need to know consciously how our brains do that; we needn't know the rule. We can leave it to cognitive science and neuroscience to find out how our brains do it, and then explain the rule to us explicitly.
So if we take a word's meaning to be the means of picking out its referent, then meanings are in our brains. That is meaning in the narrow sense. If we use "meaning" in a wider sense, then we may want to say that meanings include both the referents themselves and the means of picking them out. So if a word (say, "Tony-Blair") is located inside an entity (e.g., oneself) that can use the word and pick out its referent, then the word's wide meaning consists of both the means that that entity uses to pick out its referent, and the referent itself: a wide causal nexus between (1) a head, (2) a word inside it, (3) an object outside it, and (4) whatever "processing" is required in order to successfully connect the inner word to the outer object.
But what if the "entity" in which a word is located is not a head but a piece of paper (or a computer screen)? What is its meaning then? Surely all the (referring) words on this screen, for example, have meanings, just as they have referents.
In 19th century, the semiotician Charles Saunders Peirce suggested what some think is a similar model: according to his triadic sign model, meaning requires (1) an interpreter, (2) a sign or representamen, (3) an object, and is (4) the virtual product of an endless regress and progress called Semiosis. Some have interpreted Peirce as addressing the problem of grounding, feelings, and intentionality for the understanding of semiotic processes. In recent years, Peirce's theory of signs is rediscovered by an increasing number of artificial intelligence researchers in the context of symbol grounding problem.
Here is where the problem of consciousness rears its head. For there would be no connection at all between scratches on paper and any intended referents if there were no minds mediating those intentions, via their own internal means of picking out those intended referents.
So the meaning of a word on a page is "ungrounded." Nor would looking it up in a dictionary help: If one tried to look up the meaning of a word one did not understand in a (unilingual) dictionary of a language one did not already understand, one would just cycle endlessly from one meaningless definition to another. One's search for meaning would be ungrounded. In contrast, the meaning of the words in one's head—those words one does understand—are "grounded" (by a means that cognitive neuroscience might eventually reveal to us). And that grounding of the meanings of the words in one's head mediates between the words on any external page one reads (and understands) and the external objects to which those words refer. 
Symbol grounding and computation
What about the meaning of a word inside a computer? Is it like the word on the page or like the word in one's head? This is where the Symbol Grounding Problem comes in. Is a dynamic process transpiring in a computer more like the static paper page, or more like another dynamical system, the brain?
There is a school of thought according to which the computer is more like the brain—or rather, the brain is more like the computer: According to this view (called "computationalism", a variety of functionalism), the future theory explaining how the brain picks out its referents (the theory that cognitive neuroscience may eventually arrive at) will be a purely computational one (Pylyshyn 1984). A computational theory is a theory at the software level. It is essentially a computer algorithm: a set of rules for manipulating symbols. And the algorithm is "implementation-independent." That means that whatever it is that an algorithm is doing, it will do the same thing no matter what hardware it is executed on. The physical details of the dynamical system implementing the computation are irrelevant to the computation itself, which is purely formal; any hardware that can run the computation will do, and all physical implementations of that particular computer algorithm are equivalent, computationally.
A computer can execute any computation. Hence once computationalism finds a proper computer algorithm, one that our brain could be running when there is meaning transpiring in our heads, meaning will be transpiring in that computer too, when it implements that algorithm.
How would we know that we have a proper computer algorithm? It would have to be able to pass the Turing test (TT). That means it would have to be capable of corresponding with any human being as a pen-pal, for a lifetime, without ever being in any way distinguishable from a real human pen-pal.
Searle's chinese room argument
Searle formulated his famous "Chinese room Argument," in order to disprove computationalism. In it, he pointed out that if the Turing test were conducted in Chinese, then he himself, Searle (who does not understand Chinese), could execute a program that implements the same algorithm that the computer was using without knowing what any of the words he was manipulating meant. So if there's no meaning going on inside Searle's head when he is implementing that program, then there's no meaning going on inside the computer when it is the one implementing the algorithm either, computation being implementation-independent.
How does Searle know that there is no meaning going on in his head when he is executing such a TT-passing program? Exactly the same way he knows whether there is or is not meaning going on inside his head under any other conditions: He understands the words of English, whereas the Chinese symbols that he is manipulating according to the algorithm's rules mean nothing whatsoever to him (and there is no one else in his head for them to mean anything to). The symbols that are coming in, being rulefully manipulated, and then being sent out by any implementation of the TT-passing computer algorithm, whether Searle or a computer, are like the ungrounded words on a page, not the grounded words in a head.
Note that in pointing out that the Chinese words would be meaningless to him under those conditions, Searle has appealed to consciousness. Otherwise one could argue that there would be meaning going on in Searle's head under those conditions, but that Searle himself would simply not be conscious of it. That is called the "Systems Reply" to Searle's Chinese Room Argument, and Searle rejects the Systems Reply as being merely a reiteration, in the face of negative evidence, of the very thesis (computationalism) that is on trial in his thought-experiment: "Are words in a running computation like the ungrounded words on a page, meaningless without the mediation of brains, or are they like the grounded words in brains?"
In this either/or question, the (still undefined) word "ungrounded" has implicitly relied on the difference between inert words on a page and consciously meaningful words in our heads. And Searle is asserting that under these conditions (the Chinese TT), the words in his head would not be consciously meaningful, hence they would still be as ungrounded as the inert words on a page.
So if Searle is right, that (1) both the words on a page and those in any running computer program (including a TT-passing computer program) are meaningless in and of themselves, and hence that (2) whatever it is that the brain is doing to generate meaning can't be just implementation-independent computation, then what is the brain doing to generate meaning (Harnad 2001a)?
Formulation of symbol grounding problem
To answer this question we have to formulate the symbol grounding problem itself (Harnad 1990):
First we have to define "symbol": A symbol is any object that is part of a symbol system. (The notion of single symbol in isolation is not a useful one.) Symbols are arbitrary in their shape. A symbol system is a set of symbols and syntactic rules for manipulating them on the basis of their shapes (not their meanings). The symbols are systematically interpretable as having meanings and referents, but their shape is arbitrary in relation to their meanings and the shape of their referents.
A numeral is as good an example as any: Numerals (e.g., "1," "2," "3,") are part of a symbol system (arithmetic) consisting of shape-based rules for combining the symbols into ruleful strings. "2" means what we mean by "two", but its shape in no way resembles, nor is it connected to, "two-ness." Yet the symbol system is systematically interpretable as making true statements about numbers (e.g. "1 + 1 = 2").
It is critical to understand the property that the symbol-manipulation rules are based on shape rather than meaning (the symbols are treated as primitive and undefined, insofar as the rules are concerned), yet the symbols and their ruleful combinations are all meaningfully interpretable. It should be evident in the case of formal arithmetic, that although the symbols make sense, that sense is in our heads and not in the symbol system. The numerals in a running desk calculator are as meaningless as the numerals on a page of hand-calculations. Only in our minds do they take on meaning (Harnad 1994).
This is not to deprecate the property of systematic interpretability: We select and design formal symbol systems (algorithms) precisely because we want to know and use their systematic properties; the systematic correspondence between scratches on paper and quantities in the universe is a remarkable and extremely powerful property. But it is not the same thing as meaning, which is a property of certain things going on in our heads.
Requirements for symbol grounding
Another symbol system is natural language (Fodor 1975). On paper or in a computer, language, too, is just a formal symbol system, manipulable by rules based on the arbitrary shapes of words. But in the brain, meaningless strings of squiggles become meaningful thoughts. Harnad has suggested two properties that might be required to make this difference.
Capacity to pick out referents
One property that the symbols on static paper or even in a dynamic computer lack that symbols in a brain possess is the capacity to pick out their referents. This is what we were discussing earlier, and it is what the hitherto undefined term "grounding" refers to. A symbol system alone, whether static or dynamic, cannot have this capacity (any more than a book can), because picking out referents is not just a computational (implementation-independent) property; it is a dynamical (implementation-dependent) property.
To be grounded, the symbol system would have to be augmented with nonsymbolic, sensorimotor capacities—the capacity to interact autonomously with that world of objects, events, actions, properties and states that its symbols are systematically interpretable (by us) as referring to. It would have to be able to pick out the referents of its symbols, and its sensorimotor interactions with the world would have to fit coherently with the symbols' interpretations.
The symbols, in other words, need to be connected directly to (i.e., grounded in) their referents; the connection must not be dependent only on the connections made by the brains of external interpreters like us. Just the symbol system alone, without this capacity for direct grounding, is not a viable candidate for being whatever it is that is really going on in our brains when we think meaningful thoughts (Cangelosi & Harnad 2001).
Meaning as the ability to recognize instances (of objects) or perform actions is specifically treated in the paradigm called "Procedural Semantics", described in a number of papers including "Procedural Semantics" by Philip N. Johnson-Laird (Cognition, 5 (1977) 189; see http://www.nyu.edu/gsas/dept/philo/courses/mindsandmachines/Papers/procedural.pdf) and expanded by William A. Woods in "Meaning and Links" (AI Magazine Volume 28 Number 4 (2007); see http://www.aaai.org/ojs/index.php/aimagazine/article/view/2069/2056). A brief summary in Woods' paper reads: "The idea of procedural semantics is that the semantics of natural language sentences can be characterized in a formalism whose meanings are defined by abstract procedures that a computer (or a person) can either execute or reason about. In this theory the meaning of a noun is a procedure for recognizing or generating instances, the meaning of a proposition is a procedure for determining if it is true or false, and the meaning of an action is the ability to do the action or to tell if it has been done."
The necessity of groundedness, in other words, takes us from the level of the pen-pal Turing test, which is purely symbolic (computational), to the robotic Turing test, which is hybrid symbolic/sensorimotor (Harnad 2000, 2007). Meaning is grounded in the robotic capacity to detect, categorize, identify, and act upon the things that words and sentences refer to (see entries for Affordance and for Categorical perception).
To categorize is to do the right thing with the right kind of thing. The categorizer must be able to detect the sensorimotor features of the members of the category that reliably distinguish them from the nonmembers. These feature-detectors must either be inborn or learned. The learning can be based on trial and error induction, guided by feedback from the consequences of correct and incorrect categorization; or, in our own linguistic species, the learning can also be based on verbal descriptions or definitions. The description or definition of a new category, however, can only convey the category and ground its name if the words in the definition are themselves already grounded category names (Blondin-Massé et al. 2008). So ultimately grounding has to be sensorimotor, to avoid infinite regress (Harnad 2005).
But if groundedness is a necessary condition for meaning, is it a sufficient one? Not necessarily, for it is possible that even a robot that could pass the Turing test, "living" amongst the rest of us indistinguishably for a lifetime, would fail to have in its head what Searle has in his: It could be a Zombie, with no one home, feeling feelings, meaning meanings (Harnad 1995).
Harnad thus points at consciousness as a second property. The problem of discovering the causal mechanism for successfully picking out the referent of a category name can in principle be solved by cognitive science. But the problem of explaining how consciousness can play an independent role in doing so is probably insoluble, except on pain of telekinetic dualism. Perhaps symbol grounding (i.e., robotic TT capacity) is enough to ensure that conscious meaning is present, but then again, perhaps not. In either case, there is no way we can hope to be any the wiser—and that is Turing's methodological point (Harnad 2001b, 2003, 2006).
Symbol grounding and Brentano's notion of intentionality
"Intentionality" has been called the "mark of the mental" because of some observations by the philosopher Brentano to the effect that mental states always have an inherent, intended (mental) object or content toward which they are "directed": One sees something, wants something, believes something, desires something, understands something, means something etc., and that object is always something that one has in mind. Having a mental object is part of having anything in mind. Hence it is the mark of the mental. There are no "free-floating" mental states that do not also have a mental object. Even hallucinations and imaginings have an object, and even feeling depressed feels like something. Nor is the object the "external" physical object, when there is one. One may see a real chair, but the "intentional" object of one's "intentional state" is the mental chair one has in mind. (Yet another term for intentionality has been "aboutness" or "representationality": thoughts are always about something; they are (mental) "representations" of something; but that something is what it is that the thinker has in mind, not whatever external object may or may not correspond to it.)
If this all sounds like skating over the surface of a problem rather than a real break-through, then the foregoing description has had its intended effect: No, the problem of intentionality is not the symbol grounding problem; nor is grounding symbols the solution to the problem of intentionality. The symbols inside an autonomous dynamical symbol system that is able to pass the robotic Turing test are grounded, in that, unlike in the case of an ungrounded symbol system, they do not depend on the mediation of the mind of an external interpreter to connect them to the external objects that they are interpretable (by the interpreter) as being "about"; the connection is autonomous, direct, and unmediated. But grounding is not meaning. Grounding is an input/output performance function. Grounding connects the sensory inputs from external objects to internal symbols and states occurring within an autonomous sensorimotor system, guiding the system's resulting processing and output.
Meaning, in contrast, is something mental. But to try to put a halt to the name-game of proliferating nonexplanatory synonyms for the mind/body problem without solving it (or, worse, implying that there is more than one mind/body problem), let us cite just one more thing that requires no further explication: feeling. The only thing that distinguishes an internal state that merely has grounding from one that has meaning is that it feels like something to be in the meaning state, whereas it does not feel like anything to be in the merely grounded functional state. Grounding is a functional matter; feeling is a felt matter. And that is the real source of Brentano's vexed peekaboo relation between "intentionality" and its internal "intentional object": All mental states, in addition to being the functional states of an autonomous dynamical system, are also feeling states: Feelings are not merely "functed," as all other physical states are; feelings are also felt.
Hence feeling is the real mark of the mental. But the symbol grounding problem is not the same as the mind/body problem, let alone a solution to it. The mind/body problem is actually the feeling/function problem: Symbol-grounding touches only its functional component. This does not detract from the importance of the symbol grounding problem, but just reflects that it is a keystone piece to the bigger puzzle called the mind.
The neuroscientist Antonio Damasio investigates this marking function of feelings and emotions in his Somatic marker hypothesis. Damasio adds the notion of biologic homeostasis to this discussion, presenting it as an automated bodily regulation process providing intentionality to a mind via emotions. Homeostasis is the mechanism that keeps all bodily processes in healthy balance. All of our actions and perceptions will be automatically "evaluated" by our body hardware according to their contribution to homeostasis. This gives us an implicit orientation on how to survive. Such bodily or somatic evaluations can come to our mind in the form of conscious and non-conscious feelings ("gut feelings") and lead our decision-making process. The meaning of a word can be roughly conceptualized as the sum of its associations and their expected contribution to homeostasis, where associations are reconstructions of sensomotor perceptions that appeared in contiguity with the word. Yet, the Somatic marker hypothesis is still hotly debated and critics claim that it has failed to clearly demonstrate how these processes interact at a psychological and evolutionary level.
- It should be noted that although this article draws in places upon Frege's view of semantics, it is very anti-Fregean in stance. Frege was a fierce critic of psychological accounts that attempt to explain meaning in terms of mental states.
- Peirce, Charles S. The philosophy of Peirce: selected writings. New York: AMS Press, 1978.
- Semeiosis and Intentionality T. L. Short Transactions of the Charles S. Peirce Society Vol. 17, No. 3 (Summer, 1981), pp. 197-223
- C.S. Peirce and artificial intelligence: historical heritage and (new) theoretical stakes; Pierre Steiner; SAPERE - Special Issue on Philosophy and Theory of AI 5:265-276 (2013)
- Cf. anti-psychologism, psychologism, mentalism, intuitionism, constructivism, anti-realism, realism
- This mental process may learn a lesson from the information retrieval process without degenerating into radical cognitivism and computationalism as per the reductionist tradition.
- Or, "imputed" as read below the dotted baseline of the triangle of reference since 1923.
- This is exactly the causal, contextual theory of reference that Ogden & Richards packed in The Meaning of Meaning (1923).
- Cf. semantic externalism as claimed in "The Meaning of 'Meaning'" of Mind, Language and Reality (1975) by Putnam who argues: "Meanings just ain't in the head." Now he and Dummett seem to favor anti-realism in favor of intuitionism, psychologism, constructivism and contextualism.
- Note: This article is based on an entry originally published in Nature/Macmillan Encyclopedia of Cognitive Science that has since been revised by the author and the Wikipedia community.
- Blondin Masse, A, G. Chicoisne, Y. Gargouri, S. Harnad, O. Picard, O. Marcotte (2008) How Is Meaning Grounded in Dictionary Definitions? TextGraphs-3 Workshop, 22nd International Conference on Computational Linguistics, Coling 2008, Manchester, 18–22 August 2008
- Cangelosi, A. & Harnad, S. (2001) The Adaptive Advantage of Symbolic Theft Over Sensorimotor Toil: Grounding Language in Perceptual Categories. Evolution of Communication 4(1) 117-142.
- Cangelosi, A.; Greco, A.; Harnad, S. From robotic toil to symbolic theft: grounding transfer from entry-level to higher-level categories. Connection Science12(2) 143-62.
- Fodor, J. A. (1975) The language of thought. New York: Thomas Y. Crowell
- Frege, G. (1952/1892). On sense and reference. In P. Geach and M. Black, Eds., Translations of the Philosophical Writings of Gottlob Frege. Oxford: Blackwell
- Harnad, S. (1990) The Symbol Grounding Problem. Physica D 42: 335-346.
- Harnad, S. (1992) There Is Only One Mind/Body Problem. Symposium on the Perception of Intentionality, XXV World Congress of Psychology, Brussels, Belgium, July 1992 International Journal of Psychology 27: 521
- Harnad, S. (1994) Computation Is Just Interpretable Symbol Manipulation: Cognition Isn't. Minds and Machines 4:379-390 (Special Issue on "What Is Computation")
- Harnad, S. (1995) Why and How We Are Not Zombies. Journal of Consciousness Studies 1: 164-167.
- Harnad, S. (2000) Minds, Machines and Turing: The Indistinguishability of Indistinguishables. Journal of Logic, Language, and Information 9(4): 425-445. (Special Issue on "Alan Turing and Artificial Intelligence")
- Harnad, S. (2001a) Minds, Machines and Searle II: What's Wrong and Right About Searle's Chinese Room Argument? In: M. Bishop & J. Preston (eds.) Essays on Searle's Chinese Room Argument. Oxford University Press.
- Harnad, S. (2001b) No Easy Way Out. The Sciences 41(2) 36-42.
- Harnad, Stevan (2001a) Explaining the Mind: Problems, Problems. The Sciences 41: 36-42.
- Harnad, Stevan (2001b) The Mind/Body Problem is the Feeling/Function Problem: Harnad on Dennett on Chalmers. Technical Report. Department of Electronics and Computer Sciences. University of Southampton.
- Harnad, S. (2003) Can a Machine Be Conscious? How?. Journal of Consciousness Studies 10(4-5): 69-75.
- Harnad, S. (2005) To Cognize is to Categorize: Cognition is categorization. in Lefebvre, C. and Cohen, H., Eds. Handbook of Categorization. Elsevier.
- Harnad, S. (2007) The Annotation Game: On Turing (1950) on Computing, Machinery and Intelligence. In: Epstein, Robert & Peters, Grace (Eds.) The Turing Test Sourcebook: Philosophical and Methodological Issues in the Quest for the Thinking Computer. Kluwer
- Harnad, S. (2006) Cohabitation: Computation at 70 Cognition at 20. In Dedrick, D., Eds. Essays in Honour of Zenon Pylyshyn.
- MacDorman, Karl F. (1999). Grounding symbols through sensorimotor integration. Journal of the Robotics Society of Japan, 17(1), 20-24. Online version
- MacDorman, Karl F. (2007). Life after the symbol system metaphor. Interaction Studies, 8(1), 143-158. Online version
- Pylyshyn, Z. W. (1984) Computation and cognition. Cambridge MA: MIT/Bradford
- Searle, John. R. (1980) Minds, brains, and programs. Behavioral and Brain Sciences 3(3): 417-457
- Taddeo, Mariarosaria & Floridi, Luciano (2005). The symbol grounding problem: A critical review of fifteen years of research. Journal of Experimental and Theoretical Artificial Intelligence, 17(4), 419-445. Online version
- Turing, A.M. (1950) Computing Machinery and Intelligence. Mind 49 433-460 [Reprinted in Minds and machines. A. Anderson (ed.), Engelwood Cliffs NJ: Prentice Hall, 1964.]