What’s functionalism anyway?

Posted on April 2, 2019 | 10 Comments

In reading up for my new project on Group Thinking, I’ve found that people attaching a certain label to a view of the metaphysics of group belief and desire that I find quite attractive. That label is “functionalism”. I’ve found myself very confused about what that common label means, so what follows is where I’ve got to in sorting that out.

Now, at a really rough level, I expect anything deserving the name “functionalism” to have at least two theoretical categories: roles and realizers. For example, if you’re going to be a functionalist about the property being in pain, you’ll be committed to (i) the idea that there is a functional role associated with pain; (ii) if anything is to be in pain, then it needs to have a realizor property i.e. to instantiate a property that plays the functional role.

That allows us a lot of flexibility on how we flesh out the details beyond this. We might have various accounts of what sort of theories of functional roles to give. We might have various accounts of what the realization relation is—and whether we need to allow for multiple realisors, imperfect realizers, etc etc. We might differ in whether we identify the original property of being in pain with the role, the realizor, or something else. But unless we have an account that has the two part structure, it isn’t functionalism as I was taught it or as I teach it.

Okay, with that as the setup, let me say something about the kind of functionalism that I understand best. This starts with Lewis’s story about how to find explicit definitions of theoretical terms. We start with a theory that neologizes—that introduces a set of terms for the first time. That theory will also reuse some old vocabulary. Lewis assumed that the theory is regimented so that all the new terms are names. The old vocabulary will include predicates like “…has the property…” or “…stands in relation …. to …”, if necessary, so that we can do the work of new predicates by means of new names for the relevant properties. If we start with a theory $T(t_1,...,t_n)$ , where $t_i$ are the old terms, then the following is the unique-realization sentence for T:

$\exists y_1\ldots \exists y_n \forall x_1\ldots x_n(T(x_1,...,x_n)\leftrightarrow (x_1=y_1\wedge \ldots \wedge x_n=y_n))$

The following one-place predicate is then what we’ll mean by “the theoretical role of $t_1$ “, or the “ $t_1$ “-role:

$\exists y_n\ldots \exists y_n \forall x_1\ldots x_n(T(x_1,...,x_n)\leftrightarrow (x_1=y_1\wedge \ldots \wedge x_n=y_n))$

The explicit definition of the new terms in old vocabulary that Lewis offered was just as the property that played the relevant theoretical role. Using an iota for the definite description operator, for $t_1$ the definition is:

$t_1:=\iota y_1\exists y_2 \ldots \exists y_n \forall x_1\ldots x_n(T(x_1,...,x_n)\leftrightarrow (x_1=y_1\wedge \ldots \wedge x_n=y_n))$

Informally, the definition says that $t_1$ is the property that plays the $t_1$ -role.

Now, Lewis proves several nice results about these definitions and their relation to the original theory $T$ , using a certain understanding of the definite description operator. I won’t get into that here.

One last thing that will be important: the definite description on the right hand side of the definition sentences is, in general, a non-rigid designator. Since $T$ may be uniquely realized by definite tuples of properties in different worlds, the definite description will in general pick out different properties at different worlds. And sometimes—with empirical investigation—we will be able to say something informative about the property that happens to be picked out at the actual world. For some name $N$ in our old vocabulary, rigidly designating a property, we may discover:

$\exists y_2 \ldots \exists y_n \forall x_1\ldots x_n(T(N,x_n,...,x_n)\leftrightarrow (x_1=y_1\wedge \ldots \wedge x_n=y_n))$

From this and the definition sentence, it will follow that:

$t_1=N$

So here we have a model for how the identification of new theoretical terms with old, familiar terms could go. In these circumstances we would call $N$ the realizer of the $t_1$-role at the actual world. In general, $N_w$ will be the realizer of this role at world w iff the following holds at w: $\exists y_2 \ldots \exists y_n \forall x_1\ldots x_n(T(N_w,x_n,...,x_n)\leftrightarrow (x_1=y_1\wedge \ldots \wedge x_n=y_n))$

It’s up for debate whether $t_1$ is a rigid or non-rigid designator. If it’s a rigid designator, then $t_1=N$ will be necessary if true, but the definition sentence will be contingent (presumably, an example of the contingent a priori). $t_1$ could equally be taken to be non-rigid, allowing the definition sentence to be necessarily true (as well as apriori). In that case, $t_1=N$ will be non-rigid (as well as a posteriori). It seems we could go either way on this, consistent with the rest of the framework.

I’ve introduced both role and realizer terminology in connection to the Lewis account of the definitions of theoretical terms. It is the model for how I understand role and realizor terminology in the context of functionalism. However, discussion of theoretical neologisms is one thing, and discussion of “functional” vocabulary is another. Lewis’s topic in “how to define theoretical terms” is the former, and comes, and that gives us a particular take on the way that theory and definition sentences relate. For Lewis, the definitions are “implicitly asserted” when we put forward $T$ as a term-introducing theory—presumably we’re doing something that’s equivalent to stipulating that they are to be (a priori) true. This is not an account that can be directly applied to terms—theoretical or otherwise—that are already in common currency. It is not an account, for example, of “pain”. In the case of pain, if “definitions” are to be offered, they have to be offered as a product of analysis, not as the product of stipulation.

Let’s turn, therefore, to a context where we are working only with terms that are already common currency. And let’s suppose that we have found a theory $T$ , such that for a suitable set of target vocabulary $t_1,\ldots,t_n$ , both $T(t_1,\ldots,t_n)$ and the unique realization sentence is true. The following will be true:

$t_1:=\iota y_1\exists y_2 \ldots \exists y_n \forall x_1\ldots x_n(T(x_1,...,x_n)\leftrightarrow (x_1=y_1\wedge \ldots \wedge x_n=y_n))$

We shouldn’t call these “definition sentences” since it’s not clear in what sense if any they are “definitions”. To highlight this, note that as a limiting case, our “theory” could simply consist in saying “Red is Arnold’s favourite colour”, with Red as the target vocabulary . The unique realization sentence is then that there is an y such that for all x, x is Arnold’s favourite colour, iff x=y—which is true enough. And the putative “definition sentence” would say: Red is the y such that for all x, x is Arnold’s favourite colour iff y=x. But though this is is a true identity, this is quite clearly not a “definition” of the term Red, and is obviously contingent and a posteriori.

Not any old uniquely-realized theory of target old vocabulary will do, therefore. I take it that the step to an “analytic” functionalism of a Lewisian sort imposes the following constraint: we take an analytic/apriori $T(t_1,\ldots,t_n)$ . Now if, in addition, the unique realization sentence for this vocabulary is analytic/apriori, then the “definition sentences” will be analytic/apriori. Even if the unique realization sentence is not analytic/apriori, then the conditional whose antecedent is the unique realization sentence and whose consequent is a definition sentence will be analytic/apriori. So we could plausibly claim the definition sentences as “an analysis” of the relevant target vocabulary–perhaps an analysis modulo the assumption of unique realization. The conjecture, for the special case of analytic functionalism about pain, etc, will be that we could pull off this trick by letting T be systematization of a set of a priori “platitudes” that uniquely characterize the typical causal role of the property of being in pain in causing distinctive kinds of behaviour, and being caused by distinctive kinds of stimuli, and which interacts with other (targeted) mental states in typical kinds of ways.

The assumption that we can find an (a priori) theory T that does the job just described is a major one. But if we can do it, then we can import all the distinctions and terminology from the theoretical terms case. We will have a one-place predicate that is a “theoretical role” for the target term “pain”—which given the nature of the T we’re envisaging we could aptly call a causal-functional role of “pain”. We would be up for discovering that the role is satisfied by a property rigidly designated by some N—say, C-fibres firing. And we could reason, in the fashion Lewis and Armstrong taught us, from the “definition sentence” for pain, plus the putative empirical fact that C-fibres play the pain role, to an identification of the property of being in pain with having one’s C-fibres fire.

So that’s the way I understand analytic functionalism. And I can understand other forms of functionalism as variations on the theme. For example, we could start with a metaphysically necessary (but not analytic or a priori) theory which necessarily uniquely characterizes a set of target vocabulary, and extract definition sentences from it, obtaining necessarily true (but not analytic or a priori) “definition sentences” that we might go on to present as counting as “metaphysical analyses”. We could take a scientific theory—a theory which uniquely characterizes a set of target terms with nomic necessity, and then extract “nomic analyses”, and so forth. In each case, distinctive functionalist structure of role and realizer, and the relation between them, will be well understood. If functionalism is to be amended (e.g. to allow for imperfect realization, or non-unique realization) then I will want to figure out how to adjust the above theory to make the necessary changes.

It’s one thing to say that functionalisms can be represented as an instance of the how-to-define-theoretical-terms model of extracting definitions from theories. It’s quite another to say that every successful application of that model to common currency terms would be a functionalism. That further claim seems false to me.

For example, suppose we applied this kind of account to a term that for which we already have an analysis ready-to-hand: the property of being a bachelor. An a priori uniquely characterizing theory says (let’s suppose): bachelorhood is the property of being male and being umarried. So the “definition sentence” here is: bachelorhood is the y such that for all x, x is the intersection of being male and being unmarried iff y=x. What of the role and realizor properties here? The role property is being a y such that for all x, x is the intersection of being male and being unmarried iff y=x. What’s the realizer property?

Well, here’s a way of specifying a property that realizes the role in the minimal sense in which I introduced the terminology earlier: being a bachelor. Here’s another: the property that is the intersection of being unmarried and being male. But this seems dreadfully fishy. It doesn’t seem illuminating in the way typical identifications of realizors of functional roles would and should be. It might be true to say that pain realizes the pain role, and that the property of actually playing the pain role realizes the pain role. But in that paradigm case of functionalism what we are really interested in, and trust to be available, is some more illuminating characterization: e.g. that C-fibres firing plays the pain-role. And what we see from the bachelorhood case, I think, is that it’s entirely possible to apply all this analysis and for there to be no such illuminating identification of the realizor to be given at the end of the day.

To sum this up. In the paradigm cases of functionalism, we expect a two-step methodology. There’s first the step of identifying a relevant uniquely characterizing theory, from which by turning a crank we can extract “functional roles”. And then, we expect a second stage, where we or others do further non-trivial work (in the paradigm cases, empirical work) that gives us an illuminating way of identifying the realizors of those roles, using a vocabulary that differs from that used in characterizing the role itself. The realizors will be some relatively natural “kind” or natural enough property, relative to a somehow-privileged vocabulary. In the paradigm functionalisms, there’s also a suitable distance between the vocabulary used to specify the role, and the vocabulary used in the illuminating identification of the realizor.

Here’s a way of thinking about all this. There’s a genus-level notion of role and realizor here, which we find in functionalism, in understanding theoretical neologisms, and so forth. But in order to have a functionalism worthy of the name, we need more than such minimal roles and realizors—we need roles that are genuinely “functional” and which contrast sufficiently with their natural-enough “realizors”. That vague characterization is probably enough for us to get on with the hard work of finding examples that fit this bill.

But if this is the right way to think of things, then we should resist the thought that whenever we extract definitions from a theory in the Lewis-style, that we’re engaged in functionalist analysis. And I definitely want to resist the thought that in undertaking that first kind of project, we are committed to there being “realizors” of the theoretical roles used in those definitions in a more-than-minimal sense. Sometimes, perhaps, it will follow from the content of the characterizing theory that realizors of the roles will be more-than-minimal—e.g. perhaps that role is a causal one, and we are independently committed to thinking that only sufficiently natural properties can stand in causal relations. Perhaps part of the characterizing theory itself is the claim that the relevant property is natural enough. That might guarantee that if successful, the analysis will turn out to be a functionalist one. But this needs to be argued out on a case by case basis.

To go back to the beginning: when people talk about functionalist analyses of believing that p and desiring that q, whether in application to groups or individuals, I think that often what they’re picking out are definitions of belief and desire that are extracted from an overall theory of belief and desire in the “theoretical role” way. But it’s a huge step from that to assume that one is committed to full-blown functionalism about belief and desire, with its more-than-minimal realizors of the roles so-characterized. I think it’s misleading to label accounts that aren’t committed to more-than-minimal realizors as kinds of functionalism, and I think that’s one reason that I got myself puzzled at the way the terminology is (sometimes) used in this area.

This entry was posted in Uncategorized. Bookmark the permalink.

10 responses to “What’s functionalism anyway?”

David Chalmers | April 3, 2019 at 3:44 am | Reply

for what it’s worth, when i read your initial characterization of functionalism, i thought it wasn’t quite right because (ii) [there needs to be a realizer property] isn’t essential. well, it’s essential to “realizer” functionalism, but not to all varieties of “role” functionalism. some role functionalists will hold that functional properties needn’t always be realized — e.g. powers to phi count as functional properties even if they are pure powers without realizers.

that said, i think you’re right that merely abstracting a property as what plays a theoretical role doesn’t suffice for functionalism on ordinary philosophical usage — on ordinary usage, the role needs to be a broadly causal/dispositional role. (people sometimes use “structuralist” for the broader class of views). perhaps most roles of this sort have realizers, but the pure power case brings out that even here it isn’t essential.
Robbie | April 3, 2019 at 8:42 am | Reply

Dave: interesting! I had thought of the role- vs. realizer-functionalist distinction as a disagreement over what gets to be the target property—whether pain is C-fibres firing, or whether it possessing some-property-which-plays-the-pain-role (an existential that’s witnessed by C-fibres).

I guess on the way I’m thinking about things, there’ll always be a realizer—albeit of the minimal sort—we’re guaranteed that there *is* a property that plays the role. That does presuppose that all roles are regimented as theoretical roles. Whether there is a “substantial” realizer is of course an open question.

So if I’m getting your terminology right, if we give the theoretical-role definitions of properties, that’s a structuralist theory of them. If the theoretical role is causal/dispositional, that’s a kind of functionalism as you understand it. If we think there are substantial realizers (in some sense of “substantial” to be spelt out—e.g. non-dispositional?, natural enough?, etc) that gets us into the kinds of functionalism that I was originally thinking of them. And if we identify the target properties with the substantial realizers, that’s all the realizer functionalism of Armstrong/Lewis.
Robbie | April 3, 2019 at 9:02 am | Reply

I think I now see where a source of my confusion might be coming from—it’s because I’m taking something like realizor functionalism as the paradigm, and working backwards to what the broader genus might be.

If one starts with e.g. powers and ordinary causal/dispositional properties as the paradigm of “functional” properties, the structuralist move to deal with sets of interconnected powers (etc) looks like a powerful method of generalization. And the issue of whether these have (substantial) realizers is an additional theoretical question, linked to the question of whether single powers need some categorical base.

If one starts with theoretical terms as the paradigm, then from the beginning one is dealing with interconnected/structural definitions, but it’s natural to view the restriction to causal/dispositional theoretical roles as a bit of an optional restriction of scope. Having substantial realizers looks like a different way of retaining the elements of a paradigm functionalism such as Armstrong’s, while dropping the (from that point of view) somewhat arbitrary restriction to a particular type of functional role.
David Chalmers | April 3, 2019 at 3:33 pm | Reply

right — i’m thinking of dispositions and the like as the paradigm. that mirrors the way that functionalism developed in the philosophy of mind, as a successor to behaviorism, with the lewisian focus on theories and realizers coming along relatively late in the day (though of course there were elements of it earlier, e.g. in sellars). i can see that if one started with theoretical terms, one would end up with a different view of what’s at the core of functionalism.

your summary in the previous comment of my way of carving things up is more or less right, though as above i’d also leave the door open for versions of functionalism that don’t involve abstracting functional properties from theories.
Daniel Elstein | April 3, 2019 at 5:16 pm | Reply

This discussion clarifies things for me a lot. I think it would do me too much credit to say that I already thought what Dave says here, but certainly what he says is more in line with the way I was thinking about things when we talked about this. It seems to me that the core functionalist idea is that a casual/dispositional role is central to individuating a certain kind of state/property, either directly (role functionalism) or indirectly (realizer functionalism). I’m happy to go along with the idea that functionalists usually do assume that there are realizers, but I don’t fully understand the requirement of “more-than-minimal realizers”. If the idea is just that there has to be a genuine distinction between role and realizer, isn’t that just guaranteed by multiple realizability? I can see that the issue arises when we are assuming unique realizability, but in the case at hand (belief that P) it looks like the natural assumption of multiple realizability automatically gets you the genuine distinction you are after. So if “full-blown” functionalism is guaranteed by multiple realizability, then I think that functionalist analyses of beliefs and desires will almost always be “full-blown”.
diagonallemma | April 8, 2019 at 4:42 pm | Reply

It would be nice to have a label for the general methodology outlined in “How to define theoretical terms”. For example, Lewis says that “in order to say what a meaning is, we may first ask what a meaning does, and then find something that does that”. Is this “structuralism” about meaning? That doesn’t sound right. (But perhaps only because the label is already used for something else.) “Functionalism” sounds a little better.

When thinking about the theoretical-terms approach for properties, I find it useful to clarify that the relevant theoretical role expresses a second-order property — a property of properties. A realiser of the role is typically a first-order property of individuals. Since pain is a property of individuals, it’s clear that “pain” can’t denote the second-order role property; it must denote the realizer. But that realizer might be a property like /being in a state that does so-and-so/, which is itself a role property in some intuitive sense, and multiply realizable.

In principle, the second-order role property can be satisfied by different first-order properties at different worlds, but if we’re analysing ordinary (simple) concepts or words, I’m not aware of any clear example where this happens. The more common case, it seems, is that the same first-order property plays the theoretical role at every world, so that the iota terms are rigid. (Often, as in the case of bachelorhood, it seems that the the first-order property can simply be read off from the second-order role.)

By the way, Lewis only needs the Carnap sentence of the relevant theory to be analytic, not the theory itself. If you strengthen a theory, the theory’s Carnap sentence becomes weaker. So the relevant theories can be highly non-analytic.
Robbie | April 21, 2019 at 12:55 pm | Reply

Hi Wo, good points! (Sorry I missed this comment for a while—I need to turn notifications back on I think). And yes—I’m not sure what terminology is best, since everything that suggests itself already has usages (sometimes conflicting usage).

On the Carnap sentence/theory thing. I was thinking that in the case of newly introduced theoretical terms, there was a direct argument for thinking that that the Carnap sentence was analytic, because of the way the terms are introduced via the theory. In that case, the theory obviously didn’t need to be analytic in order to apply the method (the paradigms are empirical theories, after all).

But now consider the case of theories framed with old terms. Why think the Carnap sentence is analytic there? In general, for an arbitrary theory using old terms, it just won’t be. So I was thinking: well, since it’s sufficient for the Carnap sentence being analytic that the theory itself is analytic, we can at least apply the method whenever we’ve managed to find a uniquely-characterizing analytic theory involving some old terms. So HTDTT applied to analytic theories involving old terms is a definite method of finding explicit definitions of those old terms. Without some more guidance about which theories to look at (whose Carnap sentences have a decent chance of being analytic) then I’m not seeing how to apply HTDTT as a method of finding definitions.
- diagonallemma | April 22, 2019 at 11:12 am | Reply
  
  Hi Robbie,
  
  yes, for an arbitrary theory the Carnap sentence won’t be analytic. But if so, then that’s typically because the theory is too weak, not because it’s too strong, so making it even weaker (towards analyticity) looks like the wrong move.
  
  For example, let the chosen water theory T be the single statement that cucumbers contain water. The Carnap sentence of T says that either cucumbers contain nothing or cucumbers contain water. This isn’t analytic because one can easily imagine that cucumbers turn out to contain some stuff that isn’t water, while they don’t contain any water.
  
  Now suppose we strengthen T by adding lots of further (non-analytic) things we believe about water. It then becomes harder and harder to imagine how we could discover that cucumbers contain some stuff that plays the T role and yet don’t contain water.
  
  In general, I’d say to use HTDTT, a good start is to let the relevant theory be the totality of everything we believe about a given subject matter. The relevant Carnap sentence will almost certainly be analytic.
Robbie | April 24, 2019 at 12:01 pm | Reply

Ah! Threading replies. I remember that now.

I take your point, but here’s something that bothers me. Suppose that I have weird false beliefs about water. Maybe still, *if* something played the water-according-to-weirdo-Robbie role, it’d be water. Since nothing plays that role, the definite description we get out of HTDTT as a “definition” of water will be denotationless at the actual world. But that’s the wrong result: “water” does denote something at the actual world, despite my false beliefs about it. Something’s gone wrong.

If we had a theory that we had a guarantee to be *true* whenever the target terms denoted anything, then we wouldn’t have this problem—thus the proposal to start only with things pretty much guaranteed to be true.
- diagonallemma | April 28, 2019 at 12:10 pm | Reply
  
  Right, that’s another problem. Ideally the theory should be true whenever the target terms denote anything. But that still doesn’t mean that the theory should be analytic, which would guarantee that the theory is true no matter what. (There’s a sketch of my preferred take on all this on pp.3-4 at https://www.umsu.de/papers/functionalism.pdf )