Type–token distinction

From Infogalactic: the planetary knowledge core
Jump to: navigation, search
Although this flock is made of the same type of bird, each individual bird is a different token. (50 MB video of a flock of birds in Rome)

In disciplines such as logic, linguistics, metalogic, typography, and computer programming, the type–token distinction is a distinction that separates a descriptive concept from objects that instantiate the concept, seen as particular instances of it. For example, the sentence "the bicycle is in the garage" refers to a token of the type named "bicycle", while the sentence "the bicycle is becoming more popular" refers to the type.

This distinction in computer programming between classes and objects is similar, though in this context, "class" may refer to a set of objects (with class-level attribute or operations) rather than a description of an object in the set.

The words type, concept, property, quality, feature and attribute are all used in describing things. Some verbs fit some of these words better than others. E.g. You might say a rose bush is a plant that instantiates the type(s), or embodies the concept(s), or exhibits the properties, or possesses the qualities, features or attributes “thorny”, “flowering” and “bushy”. The term "property" is used ambiguously to mean property type (height in feet) and/or property instance (1.74). The term "concept" is probably used more often for the property type (height in feet) than the property instance.

Types like "thorny" are often understood ontologically as concepts. Types exist in descriptions of objects, but not as tangible physical objects. A type may have many tokens. However, types are not directly producible as tokens are. One can show someone a particular bicycle, but cannot show someone the type "bicycle", as in "the bicycle is popular." It is often presumed that tokens exist in space and time as concrete physical objects. But tokens of the types "thought", "tennis match", "government" and "act of kindness" don't fit this presumption.

Clarity requires us to distinguish between abstract "types" and the "tokens" or things that embody or exemplify types. If we hear that two people "have the same car", we may conclude that they have the same type of car (e.g. the same make and model), or the same particular token of the car (e.g. they share a single vehicle). The distinction is useful in other ways, during discussion of language.


There is a related distinction very closely connected with the type-token distinction. This distinction is the distinction between an object, or type of object, and an occurrence of it. In this sense, an occurrence is not necessarily a token. Considering the sentence: "A rose is a rose is a rose". We may equally correctly state that there are eight or three words in the sentence. There are, in fact, three word types in the sentence: "rose", "is" and "a". There are eight word tokens in a token copy of the line. The line itself is a type. There are not eight word types in the line. It contains (as stated) only the three word types, 'a,' 'is' and 'rose,' each of which is unique. So what do we call what there are eight of? They are occurrences of words. There are three occurrences of the word type 'a,' two of 'is' and three of 'rose'.

The need to distinguish tokens of types from occurrences of types arises, not just in linguistics, but whenever types of things have other types of things occurring in them.[1] Reflection on the simple case of occurrences of numerals is often helpful.


In typography, the type–token distinction is used to determine the presence of a text printed by movable type:[2]

The defining criteria which a typographic print has to fulfill is that of the type identity of the various letter forms which make up the printed text. In other words: each letter form which appears in the text has to be shown as a particular instance ("token") of one and the same type which contains a reverse image of the printed letter.

Charles Sanders Peirce's type–token distinction

There are only 26 letters in the English alphabet and yet there are more than 26 letters in this sentence. Moreover, every time a child writes the alphabet 26 new letters have been created.

The word 'letters' was used three times in the above paragraph, each time in a different meaning. The word 'letters' is one of many words having "type–token ambiguity". This section disambiguates 'letters' by separating the three senses using terminology standard in logic today. The key distinctions were first made by the American logician-philosopher Charles Sanders Peirce in 1906 using terminology that he established.[3]

The letters that are created by writing are physical objects that can be destroyed by various means: these are letter TOKENS or letter INSCRIPTIONS. The 26 letters of the alphabet are letter TYPES or letter FORMS.

Peirce's type–token distinction, also applies to words, sentences, paragraphs, and so on: to anything in a universe of discourse of character-string theory, or concatenation theory.[lower-alpha 1] There is only one word type spelled el-ee-tee-tee-ee-ar,[4] namely, 'letter'; but every time that word type is written, a new word token has been created.

Some logicians consider a word type to be the class of its tokens. Other logicians counter that the word type has a permanence and constancy not found in the class of its tokens. The type remains the same while the class of its tokens is continually gaining new members and losing old members.

The word type 'letter' uses only four letter types: el, ee, tee, and ar. Nevertheless, it uses ee twice and tee twice. In standard terminology, the word type 'letter' has six letter OCCURRENCES and the letter type ee OCCURS twice in the word type 'letter'. Whenever a word type is inscribed, the number of letter tokens created equals the number of letter occurrences in the word type.

Peirce's original words are the following. "A common mode of estimating the amount of matter in a … printed book is to count the number of words. There will ordinarily be about twenty 'thes' on a page, and, of course, they count as twenty words. In another sense of the word 'word,' however, there is but one word 'the' in the English language; and it is impossible that this word should lie visibly on a page, or be heard in any voice …. Such a … Form, I propose to term a Type. A Single … Object … such as this or that word on a single line of a single page of a single copy of a book, I will venture to call a Token. …. In order that a Type may be used, it has to be embodied in a Token which shall be a sign of the Type, and thereby of the object the Type signifies." – Peirce 1906, Ogden-Richards, 1923, 280-1.

These distinctions are subtle but solid and easy to master. Reflection on the simple case of occurrences of numerals is often helpful.[lower-alpha 2] This section ends using the new terminology to disambiguate the first paragraph.

There are 26 letter types in the English alphabet and yet there are more than 26 letter occurrences in this sentence type. Moreover, every time a child writes the alphabet 26 new letter tokens have been created.

See also


  1. Stanford Encyclopedia of Philosophy, Types and Tokens
  2. Brekle, Herbert E.: Die Prüfeninger Weiheinschrift von 1119. Eine paläographisch-typographische Untersuchung, Scriptorium Verlag für Kultur und Wissenschaft, Regensburg 2005, ISBN 3-937527-06-0, p. 23
  3. Charles Sanders Peirce, Prolegomena to an apology for pragmaticism, Monist, vol.16 (1906), pp. 492–546.
  4. Using a variant of Alfred Tarski's structural-descriptive naming found in John Corcoran , Schemata: the Concept of Schema in the History of Logic, Bulletin of Symbolic Logic, vol. 12 (2006), pp. 219–40.


  • Baggin J., and Fosl, P. (2003) The Philosopher's Toolkit. Blackwell: 171-73. ISBN 978-0-631-22874-5.
  • Peper F., Lee J., Adachi S.,Isokawa T. (2004) Token-Based Computing on Nanometer Scales, Proceeding of the ToBaCo 2004 Workshop on Token Based Computing, Vol.1 pp. 1–18.

External links