Abstract
Computational models of managerial search often build on backward-looking search based on hill-climbing algorithms. Regardless of its prevalence, there is some evidence that this family of algorithms does not universally represent managers’ search behavior. Against this background, the paper proposes an alternative algorithm that captures key elements of Simon’s concept of satisficing which received considerable support in behavioral experiments. The paper contrasts the satisficing-based algorithm to two variants of hill-climbing search in an agent-based model of a simple decision-making organization. The model builds on the framework of NK fitness landscapes which allows controlling for the complexity of the decision problem to be solved. The results suggest that the model’s behavior may remarkably differ depending on whether satisficing or hill-climbing serves as an algorithmic representation for decision-makers’ search. Moreover, with the satisficing algorithm, results indicate oscillating aspiration levels, even to the negative, and intense—and potentially destabilizing—search activities when intra-organizational complexity increases. Findings may shed some new light on prior computational models of decision-making in organizations and point to avenues for future research.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Avoid common mistakes on your manuscript.
1 Introduction
Computational models of managerial search often comprise adaptive processes based on experiential learning and backward-looking search behavior (e.g., Gavetti and Levinthal 2000; Kollman et al. 2000; Dosi et al. 2003; Ethiraj and Levinthal 2004; Siggelkow and Rivkin 2005; Wall 2017). In computational models of managerial search, for capturing experiential learning and backward-looking search behavior, hill-climbing algorithms prevail (for overviews see Ganco and Hoetker 2009; Puranam et al. 2015; Baumann et al. 2019). Hill-climbing algorithms build on the metaphor of a landscape with multiple peaks where the hiker (decision-maker) is moving uphill in order to find a local or even the global peak of that landscape (Wright 1932; Altenberg 1997). In this metaphor, the “landscape” with its peaks and valleys represents, for example, the different levels (“altitudes”) of performance (e.g., profit) provided by the combination of choices of a multi-dimensional decision-problem. Employing local search, a particular feature of these algorithms is that a decision-maker would never accept or preserve performance-decreasing changes (e.g., Altenberg 1997; Russell and Norvig 2016). This feature of hill-climbing algorithms has been criticized regarding cognitive biases such as escalation of commitment, overconfidence, and confirmation bias (e.g., Staw 1981; Astebro et al. 2014; Mercier 2017). Based on experimental findings, Tracy et al. (2017) recently question that hill-climbing algorithms are appropriate representations of managerial search behavior. In a similar vein, Puranam et al. (2015) and Billinger et al. (2014) argue that, according to experimental results, decision-makers adapt their search behavior to performance feedback (e.g., broadening search when performance declines loom and vice versa). Thus, they conjecture, employing a problemistic search of boundedly rational agents as it was proposed by Simon (1955) and later by Cyert and March (1963) and Greve (2003) would lead to more realistic models (for a recent literature review, see Posen et al. 2018). In this vein, based on empirical and experimental support, it was suggested to study alternative algorithmic representations of managerial search and their effects on model behavior (e.g., Billinger et al. 2014; Puranam et al. 2015; Tracy et al. 2017).
The research presented here follows this line of argumentation. The starting point is the aforementioned body of research suggesting that a search algorithm reflecting problemistic search (i.e., allowing for adaptations to problem structure and performance feedback, [e.g., Cyert and March 1963; Greve 2003; Posen et al. 2018)] may be more appropriate to capture the search behavior of boundedly rational agents than hill-climbing algorithms. On this basis, the paper seeks to contribute to computational management science by proposing and exemplarily applying an alternative algorithm for representing managerial search behavior. In particular, the paper introduces an algorithm for experiential learning and backward-looking search for managers based on Herbert A. Simon’s concept of satisficingFootnote 1 (Simon 1955) which has turned out being a relevant representation of search human behavior (e.g., Güth 2010; Caplin et al. 2011). According to Simon, satisficing means searching sequentially for options until the decision-maker regards the level of utility achieved as satisfactory. The aspiration level shapes what is regarded as satisfactory. The aspiration level and the maximum number of options searched—depending on the difficulty of the decision problem to be solved—may be subject to adaptation.
Against this background, the paper has a twofold research objective:
-
1.
Introduction of an algorithm for managerial search behavior according to Simon’s satisficing concept;
-
2.
Exemplary application of the satisficing algorithm in contrast to hill-climbing algorithms in an agent-based simulation to showcase potential differences and commonalities regarding model behavior.
For this, the paper proceeds as follows: The next section provides an overview of the theoretical background with particular focus on Simon’s idea of satisficing, before, in Sect. 3, the algorithm capturing core elements of satisficing is introduced in the context of searching for superior solutions of combinatorial decision problems.
In Sect. 4, the proposed satisficing algorithm is contrasted to hill-climbing algorithms. This is done via the example of an agent-based simulation model of organizations operating on rugged performance landscapes. The performance landscapes are modeled according to the NK framework as initially introduced in evolutionary biology (Kauffman and Levin 1987; Kauffman 1993). The rationale for this choice is that many models dealing with search in organizations build on the NK framework (for overviews, e.g., Baumann et al. 2019; Wall 2016). Hence, the NK model serves as a kind of “quasi-standard” in research on managerial search. This makes the NK model a functional basis for the second research objective mentioned above. A particular feature of the NK model is that it allows to systematically vary the complexity of a search problem in terms of the interdependencies among its sub-problems (Li et al. 2006; Csaszar 2018) which makes it appropriate to study the search behavior for varying levels of difficulty to locate the global maximum. Hence, the illustrative agent-based simulation model presented controls for the level of intra-organizational complexity among subordinate managerial decision-makers. This is particularly relevant in view of the satisficing concept since the difficulty of finding satisfactory solutions drives adjustments, for example, of the aspiration level. The model is outlined in Sect. 4.
Section 5 introduces the experimental settings for the simulations. The simulations are conducted for purposes of explanation and prediction (Za et al. 2018; Burton and Obel 2011) with particular focus on the differences that satisficing vs. hill-climbing search entail for the model behavior. The results are presented and discussed in Sect. 6 followed by concluding remarks.
2 Search and saticficing: foundations and related work
2.1 Preliminary remarks on the theoretical background
In traditional schools of economic thinking, economic actors know, at least in principle, the entire space of solutions for their decision problems. Knowing the whole search space allows them to behave as utility maximizers, i.e., detecting and choosing that option out of the solution space which maximizes the respective utility function (von Neumann et al. 2007). Simon (1955) claimed that there is an “absence of evidence that the classical concepts describe the decision-making process” (p. 104). Among Simon’s arguments is that information gathering on options and their outcomes may not be costless.
However, the cost of search and information has been introduced taking a “classical economic perspective”. Stigler (1961) claimed that information on options often is not known in advance but has to be searched, and this may reasonably bring about search costs. Accordingly, in making the concept of utility maximizing more realistic, a decision-maker has to solve a sophisticated problem of economic choice: whether or not, to incur the search cost for better information which requires to forecast the information’s benefits (i.e., better choices) in terms of all its future consequences including subsequent choices. Yet, it has been argued that this extended “version” of the utility maximizing model, though economically stringent, does not capture real situations of decision-making for several reasons—among them principal problems of mathematical tractability or cognitive limitations (e.g., Conlisk 1996; Gigerenzer 2002, 2004). Gigerenzer (2002) argues that the rule to stop searching for information when the cost exceeds benefits (Stigler 1961) may paradoxically require more time, knowledge, and computational abilities of decision-makers (“sophisticated econometricians”) than in models with unbounded rationality.
2.2 Search in computational management science and its algorithmic representation
Against this background, a large body of research in computational management science, particularly in the vein of agent-based computational economics (Tesfatsion 2003; Chang and Harrington 2006; Chen 2012), is based on the concept of bounded rationality (Simon 1955, 1959). In particular, it is often assumed that economic agents do not dispose of a “theoretical” understanding of their problems, including knowledge of the solution space (an exception is Gavetti and Levinthal 2000); instead, agents have to search stepwise for superior solutions, e.g., solutions that provide better outcome with respect to the objective than the status quo (Safarzyńska and van den Bergh 2010). Hence, in computational models of search, instead of global optimization—with or without constraints imposed by the cost of information—agents often conduct experiential learning and backward-looking search. This is represented by local search, meaning that only one or some attributes of the current state (or policy) are changed; should this change be productive compared to the status quo, the modified policy serves as basis for a new local search. This results in adaptive processes. However, there is evidence that adaptive processes based on experiential learning are biased against new alternatives (e.g., Levinthal and March 1981; Levinthal 1997), especially since adaptation does not correct early sampling errors (hot-stove effect) (Denrell and March 2001).
With the shift from “instantaneous” global optimization to stepwise and local search also the processual perspective—including questions of speed of performance enhancements and of contingent factors—comes into play. In particular, the complexity of decision problems and environmental turbulence are among the predominant contingent factors in the respective stream of research. Computational studies on search behavior have been carried out in various domains like, for example, organizational design, innovation, psychology, and, accordingly, the related approaches in prior research are rather manifold. Overviews are, for example, given in Ganco and Hoetker (2009), Wall (2016), or Baumann et al. (2019).
In computational studies capturing backward-looking search behavior, greedy algorithms and, in particular, hill-climbing algorithms predominate. According to Cormen et al. (2009, p. 414), a “greedy algorithm always makes the choice that looks best at the moment” in terms of “a locally optimal choice in the hope that this choice will lead to a globally optimal solution”. A hill-climbing algorithm—employing the metaphor of seeking the highest summit (Wright 1932; Altenberg 1997)—for a move in the landscape requires that the outcome (“altitude”) will increase. In other words: the aspiration level is a performance improvement of greater than zero. For example, with a steepest ascent hill-climbing algorithm, that option out of more than one alternatives to the status quo is selected which provides the highest improvement in outcome; if none of the alternatives promises an incline in outcome, the status quo is kept. With this, hill-climbing algorithms are particularly prone to get stuck in local maxima, ridges, or plateaus in a landscape (for overviews, e.g., Cormen et al. 2009; Macken et al. 1991; Selman and Gomes 2006). This is mainly because with these algorithms a short-term decline in favor of a long-term increase would not happen since no choice in favor of an option that provides an inferior outcome than the status quo would ever be made. Hence, hill-climbing algorithms may lead to rather myopic search processes. Moreover, as mentioned in the Introduction, it was argued that this is also in conflict with some cognitive biases which indicate that decision-makers eventually behave in favor of performance declines. These considerations gave rise to questions whether hill-climbing algorithms appropriately capture managerial search behavior (e.g., Tracy et al. 2017).
While hill-climbing algorithms are customary in computational studies capturing managerial search processes, it is worth mentioning that they often serve just as the nucleus: in many models, managerial search is embedded in a broader context. This context is, for example, defined by the incentive schemes shaping managers’ objective functions and, thus, the particular “landscapes” managers are searching in (e.g., Siggelkow and Rivkin 2005; Wall 2017). Another contingency factor is the imprecision of managers’ information, which may, accidentally, lead to short-term declines but long-term inclines of performance choices (Knudsen and Levinthal 2007; Wall 2016). Furthermore, prior research studied the decomposition of the organizational decision problem (Dosi et al. 2003) or the coordination among managers searching on partitions of the overall decision problem (Siggelkow and Levinthal 2003). Moreover, the learning-based adaptation of the structure of search comes into play. For example, based on experience, the organization of search processes (e.g., who searches on which particular decision problem) could be subject to coevolution (e.g., Wall 2018).
However, as mentioned in the Sect. 1, the potential of hill-climbing algorithms to represent managerial search behavior has been questioned, and the very core of the research endeavor presented in this paper is to introduce and illustrate an algorithm for backward-looking search based on Simon’s concept of satisficing.
2.3 Simon’s concept of satisficing: outline and related work
This section intends to provide an overview of the “satisficing” concept with particular focus on an algorithmic representation for backward-looking search.Footnote 2 The following quote captures the core idea (Simon 1955, p. 110):
In most global models of rational choice, all alternatives are evaluated before a choice is made. In actual human decision-making, alternatives are often examined sequentially. We may, or may not, know the mechanism that determines the order of procedure. When alternatives are examined sequentially, we may regard the first satisfactory alternative that is evaluated as such as the one actually selected.
The satisficing concept is explained and justified extensively in Simon’s 1955 paper and subsequent works (e.g., Simon 1959, 1979; for a reconstruction of satisficing from Simon’s early works see Brown 2004). One of Simon’s arguments is that decision-makers endowed with limited information-processing capabilities may strive for decisions which are good enough with reasonable costs of computation (Simon 1955, p. 106; Simon 1979, p. 498). The quote above indicates on three building blocks which are particularly relevant for an algorithmic representation of satificing. These areFootnote 3:
-
1.
Sequential procedure, i.e., options are discovered and evaluated sequentially;
-
2.
Aspiration level, i.e., options are evaluated with respect to a level of outcome that is regarded satisfactory;
-
3.
Stopping rule, i.e., search is stopped when the first satisfactory option is found.
Regarding the stopping rule, Simon introduces further considerations to assure, first, that—at least in the long run—a satisfactory alternative can be found while, second, in the short-term, search can provisionally stop if no satisfactory alternative is identified. For this, in particular, he introduces a dynamic perspective by considering a sequence of situations with choices to be made (Simon 1955, p. 111):
The aspiration level, which defines a satisfactory alternative, may change from point to point in this sequence of trials. A vague principle would be that as the individual, in his exploration of alternatives, finds it easy to discover satisfactory alternatives, his aspiration level rises; as he finds it difficult to discover satisfactory alternatives, his aspiration level falls. Perhaps it would be possible to express the ease or difficulty of exploration in terms of the cost of obtaining better information about the mapping of A on S, or the combinatorial magnitude of the task of refining this mapping. There are a number of ways in which this process could be defined formally. Such changes in aspiration level would tend to bring about a ‘near-uniqueness’ of the satisfactory solutions and would also tend to guarantee the existence of satisfactory solutions. For the failure to discover a solution would depress the aspiration level and bring satisfactory solutions into existence. [emphasis in original]
As Simon points out, such a mechanism of adjusting aspiration levels assures that, satisfactory solutions exist in the long run. However, as mentioned before, a second aspect is the number of alternatives a decision-maker is willing to explore. In short-term, such an upper bound assures that the search, in principle, may stop even if no satisfactory option is found; however, in a sequence of situations, the maximum number of alternatives searched may be subject to adjustment too (Simon 1955, p. 111):
Up to this point little use has been made of the distinction between A, the set of behavior alternatives, and \({\dot{A}}\), the set of behavior alternatives that the organism considers. Suppose now that the latter is a proper subset of the former. Then, the failure to find a satisfactory alternative in \({\dot{A}}\) may lead to a search for additional alternatives in A that can be adjoined to \({\dot{A}}\).
Simon mentions these two types of adjustment—i.e., regarding aspiration levels and maximum number of options searched—as examples of how decision-making behavior could be adjusted to the perceived difficulty of finding satisfactory alternatives. Moreover, the two types of adjustments may substitute or complement each other (Simon 1955, p. 112):
In one organism, dynamic adjustment over a sequence of choices may depend primarily upon adjustments of the aspiration level. In another organism, the adjustments may be primarily in the set \({\dot{A}}\): if satisfactory alternatives are discovered easily, \({\dot{A}}\) narrows; if it becomes difficult to find satisfactory alternatives, \({\dot{A}}\) broadens... The more persistent the organism, the greater the role played by the adjustment of \({\dot{A}}\), relative to the role played by the adjustment of the aspiration level.
Hence, for an algorithmic representation, the above “list” of building blocks of satisficing could be extended by
-
4.
Adjustment of aspiration level, with downward (upward) adjustment when the decision-maker finds it difficult (easy) to identify a satisfactory alternative;
-
5.
Adjustment of maximum number of options explored, with broadening (narrowing) adjustment when the decision-maker finds it difficult (easy) to identify a satisfactory alternative.
The concept of satisficing stimulated a large body of further research in various domains reaching from psychology and economics to multi-agent systems (e.g., Bianchi 1990; Gigerenzer 2002; Todd and Gigerenzer 2003; Parker et al. 2007; Schwartz 2008; Rosenfeld and Kraus 2012).
For example, key elements of satisficing are among the foundations of “the adpative toolbox” comprising “fast and frugal heuristics” introduced by Gigerenzer (2002). Moreover, Simon’s satisficing provides a basis for Selten’s prominent “aspiration adaption theory” (Selten 1998, 2002). However, particularly the idea of aspiration levels has given rise to questions on how they are initially set and how they are updated (e.g., Bianchi 1990; Lant 1992; Güth 2007; Schwartz 2008). Another stream of research studies the effects of aspiration levels in an organizational context, especially of aspirations in terms of organizational targets (e.g., Mezias 1988; Washburn and Bromiley 2012; Joseph and Gaba 2015).
A further body of research seeks to test how far satisficing captures real human decision-making behavior empirically. For example, in an experimental study Caplin et al. (2011) find considerable support for key elements of satisficing behavior, namely sequential search and stopping a search process when a decision-maker regards the level of outcome satisfactory. Another stream of research refers to the conditions when decision-makers seek to behave as maximizers or satisficers, i.e., to styles of decision-making (e.g., Schwartz et al. 2002; Parker et al. 2007).
Close in spirit to these research efforts, is the paper of Billinger et al. (2014) which presents an experimental study on how decision-makers adapt their search behavior to the complexity of the decision problem. The authors find that search behavior is adjusted to subjective reference points of performance feedback which reflect success and failure. In particular, success narrows the search activities to local search; in contrast, failure promotes more distant search. Since the complexity of a decision problem makes it more difficult to find successful solutions, complexity affects the feedback that decision-makers receive in their search for new alternatives, leading to an adaptation of individuals’ search behavior (with further references, see also Puranam et al. 2015; Posen et al. 2018).
3 Algorithmic representation of satisficing managerial search behavior
3.1 Preliminary remarks
This section introduces a computational model of organizations with decision-making agents that employ satisficing in backward-looking search behavior following Simon’s concept as introduced in Sect. 2.3. The model is presented for decision-making agents facing a multidimensional binary decision problem. This modeling choice builds on two arguments:
First and most important, as it was outlined above, a considerable body of research in computational management science employs the prominent NK framework. In its standard form, the NK framework comprises N-dimensional binary bit strings as the vector of choices or features adapted throughout adaptive processes based on some kinds of learning or evolution. Hence, modeling the satisficing concept for binary decision problems eases the integration into prior research.Footnote 4
Second, a fixed dimensionality binary decision problem facilitates to model satisficing search behavior. For example, the maximum number of alternatives (see Sect. 2.3) and the term neighborhood can be figured out easily. However, the author believes that the simplifying assumption of binary decision problems does not limit, in principle, transferring the proposed algorithm of satisficing search behavior to other types of decision problems.
3.2 Process structure of satisficing search
Subsequently, satisficing search behavior of a manager r is described where manager r may be one out of M managers in an organization (i.e., \(r=(1,\ldots ,M)\)). Manager r faces an \(N^r\)-dimensional binary decision problem.
According to the behavioral assumptions of Simon (1955), manager r is not able to survey the entire search space and, hence, cannot “locate” the optimal solution of its decision problem “at once”. Instead, manager r employs a time-consuming search process to identify solutions with superior performance, or even the optimal solution, regarding manager r’s objective.
As outlined in Sect. 2.3, a particular feature of satisficing search behavior is that, when searching for superior performance, an agent may adapt the aspiration level and the maximum number of alternatives discovered before the agent decides to stop searching. Hence, the proposed model comprises three adaptive processes which are related to each other: In each period t of time,
-
1.
manager r sequentially searches for novel options to its particular decision problem within the institutional framework given which includes, for example, division of labor or rewards provided (Sect. 3.3);
-
2.
manager r adjusts the aspiration level \(a^r\) that a newly found option will have to meet to be selected in the next period based on the performance improvements resulting from the solutions implemented in the past (Sect. 3.4);
-
3.
manager r adjusts the maximum number \(s^{max,r}\) of options to be discovered before search is stopped depending on the number of options that manager r had to search for before a satisficing alternative was found in the past (Sect. 3.5).
Figure 1 shows the principle process of satisficing search behavior of a manager r. Subsequently, the model is described in more detail.
3.3 Sequential search for new options
A key feature of satisficing search is that new options are discovered and evaluated sequentially: the agent discovers one novel option \(\mathbf {d}^{s^r}_t\) and evaluates (i.e., searches for “cues” in the terminology of Simon (1955) whether it promises a performance improvement compared to the status quo \(\mathbf {d}^{r}_{t-1}\) that, at least, meets the aspiration level \(a^r(t)\), i.e., when
with
If so, this option is implemented, and search is stopped for this time step t; otherwise, the next option is searched and evaluated as far as the maximum number of options \(s^{max,r}(t)\) is not reached yet (see Fig. 1).
With manager r facing an \(N^r\)-dimensional binary decision problem, at maximum, \(2^{N^r}-1\) alternative configurations \(\mathbf {d}^{r}\) compared to the status quo could be implemented. Hence, the upper bound for the maximum number of options is
For an algorithmic representation of satisficing, defining a sequence of the agent’s discoveries of new options is necessary. For the sequence of options’ discovery, various possibilities are feasible. For example, one obvious way is to let the agent randomly discover one out of the \(2^{N^r}-1\) alternatives (if an option has been discovered before in that time step t, the random draw is repeated).
However, the simulation experiments presented subsequently employ a “closest-first” search policy which reflects the idea of neighborhood search: a manager r starts searching in the immediate “neighborhood” of the status quo. Should this not lead to a satisficing option, manager r extends the “circle” of search around the status quo. As outlined in Sect. 2.3, this modeling choice of the sequence is broadly supported by experimental evidence (Billinger et al. 2014; Puranam et al. 2015; Posen et al. 2018).
Hence, the sequence follows increasing Hamming distances of discovered alternatives to the status quo where the Hamming distance of an alternative option \(\mathbf {d}^{s^r}_t\) to the status quo is given by
Hence, the search starts with alternatives with a Hamming distance \(h(\mathbf {d}^{s^r}_t)=1\), then followed by options with a Hamming distance of two and so forth, as long as neither the aspiration level is met nor the maximum number of options \(s^{max,r}\) to be considered is reached. Among the options with equal Hamming distance the sequence is given at random.Footnote 5
The rationale for a sequence given by increasing Hamming distances is as follows: This sequence appears particularly appropriate to capture the idea of stepwise improvement of a given configuration. With respect to the cost of search and change, small steps (i.e., Hamming distance equal to 1) could be assumed to show lower cost than more distant options which require more changes. Hence, the “closest-first” search policy may be based on considerations of cost of search and change.
However, it is worth mentioning that other forms of the sequence of searching are arguable too: For example, a manager may be rewarded based on the particular novelty of the options chosen, which could give reason to start searching with the most distant alternatives possible.
3.4 Adaptation of the aspiration level
As mentioned in Sects. 2.3 and 3.2, a core element in satisficing is the aspiration level. Newly found options are evaluated according to whether or not they promise to meet the aspiration level, and the aspiration level is subject to adaptation based on experience (Simon 1955): The aspiration level may increase (decrease) depending on how easy (difficult) it was to find a satisfactory alternative in the past.
In the proposed model of satisficing search behavior, the aspiration level is adjusted according to the performance experience, i.e., an improvement or deterioration of performance (see Eq. 2) achieved over time. In particular, the aspiration level \(a^r(t)\) is captured as an exponentially weighted moving average of past performance changes where \(\alpha ^r\) denotes the speed of adjustment for manager r (Levinthal and March 1981; Böergers and Sarin 2000; Levinthal 2016), i.e.,
It is worth emphasizing that the aspiration level could also become negative—i.e., a performance decline becoming acceptable—if declines happened in the past. This establishes a contrast to hill-climbing algorithms where decision-makers would not accept performance declines (see Introduction). Section 6.1 comes back to this aspect.
3.5 Adaptation of the maximum number of options searched
In a similar vein, the space of options in which a manager searches for satisfactory alternatives may be dynamically adjusted. When it turns out to be difficult to find satisfactory options, the search space for alternatives is broadened; when finding satisfactory options is easy, search space is narrowed (Simon 1955). As mentioned in Sect. 2.3, this type of adjustment is supported by experimental results (Billinger et al. 2014).
In the modeling effort presented here, this is captured as adjustment of the maximum number \(s^{max,r}\) of options that the decision-making agent r may consider in the next time step. In particular, if in period t a maximum number of options, i.e., \(s^{r}(t)=s^{max,r}\), was searched and evaluated without that a satisfactory alternative to the status quo was identified, then for \(t+1\) the (potential) search space increases. For this, again, an exponentially weighted moving average of past search spaces is employed where \(\beta ^r\) denotes the speed of adjustment for manager r. Hence, the search space results from
However, since the maximum search space \(s^{max,r}\) has to be an integer, the moving average according to the upper case of Eq. 6 is to be rounded up or down which is done according to
Hence, with Eq. 7, the “adjusting” procedure in Eq. 6 does not necessarily result in an adjusted space \(s^{max,r}(t+1)\) of options for the next period.
4 Example: an agent-based model of search in collaborative organizations based on satisficing vs. hill-climbing
4.1 Overview
This section intends to contrast, employing an example, the adaptive walks of organizations with satisficing managers to organizations with managers employing a hill-climbing algorithm as familiar in the domain of agent-based computational organization science.
To this end, the model captures artificial organizations that use only a few organizational design elements. In particular, organizations do not comprise more elements than the decomposition of an overall decision problem into sub-problems, the delegation of sub-problems to decision-making managers, and a headquarters rewards managers according to their respective performance. One may think of the decision-making managers as being the heads of respective departments. However, in terms of the concept of Agent Action Diagrams (AAD) proposed by Li et al. (2019), the model explicitly only captures the “thinking” parts of the managers’s actions (e.g., searching and evaluating of options, decision-making); the “command” and “execution” actions related to the choices made are not explicitly modeled; rather, it is assumed that these actions are put forward according to the choices.Footnote 6 The Appendix outlines the agent entity organization diagram and the agent entity action attribute diagram according to the AAD concept (Li et al. 2019).
Hence, the model captures simple—for not to say: simplistic—organizations which refrain from many design elements as, for example, more sophisticated coordination mechanisms like organizational routines (e.g., Gao et al. 2018; Gao and Akbaritabar 2021) or team-based compensation (e.g., Siggelkow and Rivkin 2005; Wall 2017) which were incorporated in many agent-based models.
The rationale for this is to study the effects of the satisficing compared to the hill-climbing algorithm on managerial search in organizations in as pure a form as possible and without being entangled with organizational mechanisms that are intended to align individual behavior with organizational objectives.
In the following, first, the overall organizational decision problem, its decomposition, and delegation to managers are introduced (Sect. 4.2). Next, a description of managers’ objective functions and information basis (Sect. 4.3) follows. Third, search and decision-making via hill-climbing are briefly outlined in contrast to satisficing (Sect. 4.4).
4.2 Decision problem and structure of the organizations
In the simulation model, artificial organizations are observed while searching for superior solutions for a decision problem according to the framework of NK-fitness landscapes (Kauffman and Levin 1987; Kauffman 1993). In particular, at each time step t the organizations face an N-dimensional binary decision problem, i.e., \(\mathbf {d_t}={(d_{1t},...,d_{Nt})}\) with \(d_{it}\in \left\{ 0,1\right\}\), \(i=1,...,N\), out of \(2^{N}\) different binary vectors possible. Each of the two states \(d_{it}\in \left\{ 0,1\right\}\) provides a distinct contribution \(C_{it}\) to the overall performance \(V(\mathbf {d_t})\). The contributions \(C_{it}\) are randomly drawn from a uniform distribution with \(0\le {C_{it}}\le 1\). Parameter K (with \(0\le K\le N-1\)) reflects the number of those choices \(d_{jt}\), \(j\ne {i}\) which also affect the performance contribution \(C_{it}\) of choice \(d_{it}\). Hence, K captures the complexity of the decision problem in terms of the interactions among decisions: this means that contribution \(C_{it}\) may not only depend on the single choice \(d_{it}\) (being 0 or 1) but also on K other choices:
with \(\left\{ i_1,...,i_K\right\} \subset \left\{ 1,...,i-1,i+1,...,N\right\}\). In case of no interactions among choices, K equals 0, and K is \(N-1\) for the maximum level of complexity where each single choice i affects the performance contribution of each other binary choice \(j\ne i\). The overall performance \(V_t\) achieved in period t results as normalized sum of contributions \(C_{it}\) from
The organizations have a hierarchical structure and comprise two types of agents: (1) one headquarter and (2) M managers. The organizations make use of division of labor. In particular, the N-dimensional overall decision problem is decomposed into M disjoint partial problems, and each of these sub-problems is exclusively delegated to one manager \(r=(1,\ldots ,M)\). For the sake of simplicity, the sub-problems are of equal size \(N^r\).Footnote 7 Each manager r is endowed with decision-making authority on its “own” partition of the organization’s decision problem.
The headquarter seeks to maximize the overall performance \(V_{t}\) according to Eq. 9. However, its role is restricted to—at the end of each time step t—observing the overall performance \(V_{t}\), observing each manager’s performance contribution and rewarding managers accordingly.
Depending on the complexity K of the N-dimensional decision problem and the particular structure of interactions among the M sub-problems, indirect interactions among the managers’ choices may result. Let \(K^{ex}\) denote the level of interdependencies across managers’ sub-problems. In case that interdependencies across sub-problems exist, i.e., if \(K^{ex}>0\), then the performance contribution of manager r’s choices to overall performance V is affected by choices made by other managers \(q \ne r\) and vice versa (see, for example, Fig. 2b).
4.3 Managers’ objective functions and information
The managers seek to maximize compensation which is merit-based and depends on the performance contribution \(P^{r}_{t}(\mathbf {d_t})\) of manager r to overall performance \(V(\mathbf {d_t})\) according to Eq. 9. Hence, we have
with
and with \({w}={\sum ^{r-1}_{m=1} {N^m}}\) for \(r>1\) and \({w}=0\) for \({r}=1\).
For the sake of simplicity, compensation of manager r depends linearly on the value base \(P^{r}_{t}(\mathbf {d_t})\) for all levels of \(P^{r}_{t}\). Hence, by increasing the performance contribution \(P^{r}_{t}\) of the partial solution for the \(N^r\)-dimensional sub-problem to the overall organization’s decision-problem, manager r also increases its compensation.
However, when making choices on their respective partial configurations \(\mathbf {d^r_t}\), the managers show some further cognitive limitations (apart from not knowing the entire space of solutions and, thus, having to search for options):
First, manager r cannot anticipate the other managers’ \(q\ne {r}\) choices; rather manager r assumes that the fellow managers will stay with the status quo, i.e., opt for \(\mathbf {d}^{q*}_{t-1}\). This is since, at the the beginning of every period, manager r gets knowledge of the solution \(\mathbf {d}^{*}_{t-1}\) to the overall decision problem of the organization and, hence, of the fellow managers’ choices. However, the model does not capture any further mutual perceptions among managers (e.g., sympathy, social identification) between managers (e.g., Smarzhevskiy and Solovev 2020; Ellemers et al. 2004).
Second, manager r is not able to perfectly ex-ante evaluate the effects of any newly discovered option \(\mathbf {d}^{s^r}_t\) on the value base for compensation \(P^{r}_{t}(\mathbf {s^d}^{r}_t)\) (see Eq. 11). Rather, ex ante evaluations are afflicted with noise which is, for the sake of simplicity, an relative error imputed to the actual performance (Wall 2010; for further types of errors see Levitan and Kauffman 1995; for further models of managerial search capturing imperfect evaluations see Carley and Zhiang 1997; Chang and Harrington 1998; Knudsen and Levinthal 2007). The error terms \({e}^{r}(\mathbf {d}^{s^r}_t)\) follow a Gaussian distribution \(N(0;\sigma )\) with expected value 0 and standard deviations \(\sigma ^{r}\); errors are assumed to be independent from each other. Hence, the value base of compensation \({\tilde{P}}^{r}_{t}(\mathbf {d}^{s^r}_t)\) of a newly discovered \(\mathbf {d}^{s^r}\) option as ex ante perceived by manager r is
Thereby, when making decisions, each manager r has a different “view” of the actual fitness landscape which results from (1) the decomposition of the overall decision problem and the delegation of sub-problems and (2) from the managers’ individual “perceptions” due to the individualized error terms \(\sigma ^{r}\). However, for the status quo option \(\mathbf {d}^{r*}_{t-1}\), it is assumed that manager r remembers the compensation from the last period. From this, manager r also knows the actual performance \({P}^{r}_{t}\) of the status quo, should the manager choose to stay with it in time step t and if, in case of interactions across sub-problems, also the fellow managers stay with the status quo.
4.4 Search strategies
In every time step t, each manager r seeks to identify a superior configuration for its partial decision problem \(\mathbf {d^{r}_{t}}\) with respect to the value base of compensation. The search strategy shapes the options a manager can choose. The simulation model contrasts adaptive walks of organizations with satisficing managers to those organizations with managers employing a steepest ascent hill-climbing algorithm as frequently employed in computational management science. In Sect. 3, the model of satisficing search was introduced. Hence, at this point, a short outline of hill-climbing in the context of the model follows.
In particular, as already mentioned, in the model the managers cannot survey the entire search space and, hence, they have to search stepwise for superior solutions. Following a hill-climbing algorithm, a manager searches in the neighborhood for a fixed number \(s^{max,r}\) of alternatives and opts for an alternative only if it promises a higher performance (“fitness”) than the status quo. The distance to the status quo defines the term neighborhood and, in the context of the NK-model, is measured by the Hamming distance \(h(\mathbf {d}^{s^r}_t)\) of an alternative option to the status quo \(\mathbf {d}^{r}_{t-1}\) according to Eq. 4.
In the most simple case, the neighborhood is set to \(h(\mathbf {d}^{s^r}_t)=1\) and the number of alternatives is \(s^r = s^{max,r}=1\), too. This means that only one alternative to \(\mathbf {d}^{r}_{t-1}\) is discovered where—usually at random—one bit is flipped. However, the “allowed” neighborhood of search could be broader than one, and also the number of alternatives the manager identifies could be higher than one. Both is often employed in models of organizational search (e.g., Siggelkow and Rivkin 2005; Wall 2017; for overviews Chang and Harrington 2006; Baumann et al. 2019). If the number of alternatives \(s^{max,r}\) identified providing a performance incline is higher than one, that option with the highest incline is selected (steepest ascent hill-climbing). Hence, three aspects of this hill-climbing algorithm (HCA) appear noteworthy in comparison to the satisficing algorithm (see Sect. 3):
-
1.
In the HCA, the number \(s^r\) of newly discovered alternatives per period equals the maximum number of alternatives allowed, i.e., \(s^r = s^{max,r}\). Moreover, the maximum number of alternatives is not subject to adaptation based on experience over time like in satisficing.Footnote 8
-
2.
The HCA employs an aspiration level of zero: alternatives with a performance incline compared to the status quo are worth being selected by a manager (i.e., \(a^r > 0\)). Additionally, unlike in satisficing, the aspiration level is not adapted according to experience.Footnote 9
-
3.
In the HCA, options are not searched and evaluated in sequence with a stop of searching when an alternative meets the aspiration level like in satisficing. Instead, in case that the HCA is parametrized to two or more alternatives to be searched (i.e., if \(s^{max,r}>1\)), the search stops when \(s^{max,r}\) alternatives are identified. Then, these \(s^{max,r}\) options are evaluated against the status quo and against each other to figure out the steepest ascent.
The paper presents the results of simulations for organizations with managers employing satisficing versus hill-climbing search. For this, the next section introduces the particular parameter settings of the simulation experiments.
5 Simulation experiments and parameter settings
The simulation study seeks to provide insights into how satisficing managerial search behavior compared to hill-climbing behavior affects the organizations’ resulting adaptive walks. Table 1 displays the parameter settings which are explained in the remainder of this section.
The parameter settings in the upper part of Table 1) apply to experiments with both satisficing and hill-climbing types of managers. As such, organizations are observed for 250 periodsFootnote 10 when searching for superior solutions to an \(N=12\)-dimensional decision problem. The overall decision problem is decomposed into \(M=4\) equal-sized sub-problems of which each is delegated exclusively to a subordinate manager. The Appendix provides a graphical description following the “agent action diagram (AAD)” approach (Li et al. 2019).
The experiments are conducted for different levels of complexity of the organizations’ decision problems: In particular, the organizations may have a perfectly decomposable interaction structure of decisions which captures situations where, for example, the task of an organization is perfectly decomposable along geographical regions or products without any interdependencies across regions or products, respectively (Galbraith 1974; Rivkin and Siggelkow 2007; Simon 1962). Figure 2a gives an example of a situation with no interactions across managers’ sub-problems (i.e., \(K^{ex}=0\)). Alternatively, the interaction structures captured in the experiments may exhibit a low, medium, or high level of interactions across sub-problems. For example, Fig. 2b shows a case of a high level of cross-problem interactions (i.e., \(K^{ex}=5\)). This interaction structure may represent situations caused by certain constraints of resources (budgets or capacities), by market interactions (prices of one product may affect the price of another) or functional interrelations (e.g., the product design sets requirements for logistics or procurement processes) (Thompson 1967; Galbraith 1973; Rivkin and Siggelkow 2007; Li et al. 2021).
When ex-ante evaluating newly discovered options, the managers suffer from some noise (see Eq. 12) following a Gaussian distribution with mean 0 and a standard deviation of 0.05. This parametrization intends to reflect some empirical evidence according to which error levels around 10% could be a realistic estimation (Tee et al. 2007; Redman 1998).
Regarding experiments for organizations resided by satisficing managers (see middle part of Table 1), the aspiration levels of performance enhancements start at a level of zero for two reasons: first, this corresponds to hill-climbing (see Sect. 4.4) and, hence, eliminates one source of potential differences between the two modes in the experiments. Second, this “conservative” setting captures the desire to avoid, at least, situations of not-sustaining an already achieved performance level. For satisficing search, the maximum search space starts at a moderate level of just two alternatives, which also relates to a search space often specified for hill-climbing search in computational management science. Regarding the speed of adjustment for both the aspiration level of performance enhancements and the maximum number of alternatives, the present observation and the past are weighted equally with \(\alpha ^r\) (Eq. 5) and \(\beta ^r\) (Eq. 6), respectively, set to 0.5.
The simulations experiments comprise two different steepest ascent hill-climbing strategies (see the lower part of Table 1). In particular, in the “HC2”-strategy, in every time step, each manager discovers two alternatives to the respective status quo, each alternative with one bit flipped compared to the status quo and thus captures local search. With the “HC6”-strategy, in every time step, 6 alternatives to the current configuration are discovered, i.e., 3 with Hamming distance 1 and 3 with Hamming distance of 2.Footnote 11
The HC2-strategy corresponds to agent-based models in prior research which study local search—often in comparison to other forms of search (e.g., Levinthal 1997; Jain and Kogut 2014)—and, thus, serves as a basis for comparisons of simulation results obtained with satisficing agents. In contrast, the HC6-strategy serves another purpose in the experiments: it captures a kind of “upper bound” of feasible partial alterations given the overall decision-problem of the size \(N=12\) and its decomposition into four equal-sized sub-problems. Hence, the HC6-strategy provides a broad search space and an obvious question is whether search spaces in satisficing (see Eq. 6) may evolve to the same high level.
6 Results and discussion
In order to be clear and concise in exploring the parameter space, the results of the simulation experiments are presented in two steps. Following the idea of factorial design of simulation experiments (Lorscheid et al. 2012), Sect. 6.1 introduces results of two baseline scenarios to analyze the principal effects of satisficing vs. hill-climbing managerial search behavior. In particular, organizations facing a decomposable decision-problem (i.e., \(K^{ex}=0\)) and organizations which have to deal with a medium level of complexity (i.e., \(K^{ex}=3\)) are studied. Section 6.2 provides an analysis of the sensitivity to intra-organizational interactions for a broader range of complexity levels of the organization’s decision-problem.
6.1 Baseline scenarios
Table 2 reports condensed results obtained from the simulation experiments for the baseline scenarios. For each scenario (i.e., combination of interaction structure and search strategy), the respective 2500 simulation runs were analyzed with respect to several metrics.Footnote 12
The performance change achieved on average in the first ten periods informs about the speed of performance enhancement at the beginning of the adaptive walks, which may be particularly relevant in turbulent environments (Siggelkow and Rivkin 2005). However, with respect to satisficing, the usually high performance inclines at the beginning of search are particularly interesting for the adjustments of aspiration levels and search spaces. The final performance, i.e., performance \(V_{t=250}\) achieved in the last period of the observation time on average in the 2500 simulation runs per scenario, informs about the effectiveness of the search processes. This also applies to the relative frequency of how often the global maxima in the respective performance landscapes have been found in the 2500 simulation runs per scenario. The ratio of periods in which a new configuration \(\mathbf {d_t}\) is implemented characterizes the adaptive walks more into detail.
Figure 3 plots the performance levels obtained in the course of adaptive walks over time for each scenario. Figure 4 displays the adaptation of aspiration levels over time for the two satisficing scenarios. Please, recall, in the hill-climbing scenarios, aspiration levels are zero (see Sect. 4.4), which is why they are not plotted. Figure 5 reports on the adjustment of the search spaces in satisficing search for the decomposable and the non-decomposable structure; the search spaces of the scenarios employing hill-climbing are fixed, as is also indicated in the figure.
The following discussion of results mainly focuses on satisficing search behavior in contrast to the hill-climbing strategies (and less on comparing the hill-climbing modes against each other).
The plots in Fig. 3 indicate that the performance enhancements obtained via satisficing search behavior are at medium levels compared to the two hill-climbing modes (which, however, perform differently well in the two interaction structures) for both interactions structures under investigation. The results reported in Table 2 also suggest that satisficing search is at medium levels regarding initial performance enhancements and final performances. For the frequency of global maximum found, satisficing search outperforms both hill-climbing models in the non-decomposable structure. With satisficing, the ratio of periods with altered configurations is at a notably high level compared to the HC2 strategy. In the case of a decomposable structure, it even exceeds the level of the HC6 strategy. For a closer analysis of results, it appears helpful also to consider the adjustment processes of the aspiration levels and maximum search spaces as plotted in Figs. 4 and 5, respectively.
6.1.1 Decomposable interaction structure
Each manager faces a partial binary problem in the decomposable interaction structure without any interactions among the managers’ problems existing. Hence, the organization’s overall performance maximum could be found by identifying the sub-problems’ optimal solutions. Therefore, with a broad search space enabled at the managers’ site as with the HC6-strategy, it is not surprising that the adaptive walks quickly reach performance levels close to the maximum of 1. With managers employing satisficing behavior, the performance levels achieved are close to that of the HC6-strategy. Moreover, the maximum number of alternatives per manager increases rather quickly to nearly the high level of 6 as fixed for the HC6 strategy and remains at this high level (see Fig. 5).
The explanation for this is as follows: in the decomposable structure, managers likely find configurations with high or even the maximum performance level for their partial problem. However, from a very high (or maximal) performance level, it becomes more difficult (or impossible) to further increase performance. However, according to the behavioral assumptions underlying the idea of bounded rationality (Simon 1955), the managers are not aware of whether they already have identified the optimal solution. In consequence, since managers experience it difficult to further increase performance, according to the satisficing concept, the search space is increased. It remains at a high level in the—potentially futile—attempt to increase performance further.
The adaption of aspiration levels follows an inverse adjustment: After a high incline in the first periods—due to high inclines of performance at the beginning—the aspiration levels decline quickly to a level of zero: with being close to the best configuration (or having it found already), further performance enhancements are unlikely (or even impossible) and, hence, the aspiration levels of decision-making managers, persistently, remain at a level of zero. However, a closer analysis reveals that the aspiration levels oscillate closely around zero. This is because the managers in the model are not capable of evaluating options perfectly. Hence, false-positive choices may occur, which then affect the aspiration levels and may turn them to the negative (see Eq. 5).Footnote 13
6.1.2 Non-decomposable interaction structure
In the non-decomposable structure, the link between managers’ sub-problems and the overall decision-problem is more complicated than in the decomposable case for two reasons. First, when searching for superior solutions to their partial decision problems, managers do not necessarily increase overall performance. Hence, maximizing parochial performance and the overall performance of the organization may conflict with each other. The second reason refers to managers’ cognitive limitations regarding their fellow managers’ choices. Due to interactions among sub-problems, manager r’s choice for the partial problem \(\mathbf {d}^r\) may affect the performance \(P^{q}_{t}(\mathbf {d^q_t})\) (Eq. 11) of another manager \(q\ne r\) and vice versa. Since, in the model, the managers notice their fellow managers’ choices with one period of delay, this may lead to frequent, time-delayed mutual adjustments in order to keep up with the fellow managers’ choices, which again induces mutual adjustments and so forth. These considerations reflect the lower performance levels achieved, the lower frequencies of the global maximum found, and the higher ratios of altered configurations compared to the decomposable structure reported in Table 2. These results, in principle, correspond to prior research employing computational models of organizations (e.g., Carley 1992; Rivkin and Siggelkow 2007; Siggelkow and Rivkin 2005). However, the differences across search strategies are remarkable, which is analyzed in more detail in Sect. 6.2.
In the satisficing strategy, the adjustments of maximum search spaces and aspiration levels deserve closer inspection. Regarding the adjustment of maximum search spaces in the satisficing strategy (Fig. 5), for the non-decomposable structure, we again notice an increase over time—though up to a lower level of about 5 per manager and with a lower gradient compared to the decomposable structure. This may result from the following effects: as argued above, in non-decomposable structures, it is rather difficult to identify solutions that induce performance enhancements. However, when finding promising options becomes more difficult, with satisficing the maximum search space is increased. At the same time, this may counteract the peril of sticking to local maxima, and the peril of inertia is the more pronounced, the higher the complexity K (or \(K^{ex}\)) of a decision-problem.Footnote 14
The adjustment of aspiration levels plotted in Fig. 4 shows the inverse development, and aspiration levels decline over time. Contrary to the decomposable case, now the aspiration levels oscillate remarkably around a level of zero. Hence, an interesting question is what may cause these oscillations. Like in the decomposable structure, the imperfect evaluations contribute to oscillations of aspiration levels: imperfect ex-ante evaluations may lead to performance declines due to false-positive choices. Accordingly, these “negative” experiences are reflected in the adjustment of the aspiration levels. Additionally, in the non-decomposable structure, interactions among sub-problems combined with cognitive limitations regarding the choices of fellow managers further induce oscillating aspiration levels:
-
1.
When making their choices in time step t, decision-makers assume that their fellow managers stay with the status quo. This is particularly problematic in case of interactions among decision-problems and may cause “surprises” and, in consequence, frequent mutual adjustments (which happens in about 60% of periods, see Table 2);
-
2.
The actual choices of fellow managers are revealed only at the end of period t which causes a time-delay in the aforementioned mutual adjustments to the other managers’ choices;
Hence, due to interactions combined with alterations by fellow managers, performance declines may happen which reduce aspiration levels even below zero.
In sum, intra-organizational complexity in combination with imperfect information in decision-making reasonably causes frequent alterations of configurations \(\mathbf {d}\) and oscillations of aspiration levels in the satisficing strategy. We return to this aspect in Sect. 6.2. Taking a more general perspective on the baseline scenarios, one may summarize the findings in the following hypotheses:
-
(1)
Organizations in which decision-makers with satisficing search behavior reside and which already have identified configurations providing high levels of performance are likely to employ extensive search and aspiration levels which enforce to (just) maintain the performance.
-
(2)
Intra-organizational complexity combined with cognitive limitations of decision-makers showing satisficing search behavior induces high levels of search activity and oscillating aspiration levels.
These hypotheses could be related to organizations’ maturity, and organizational learning in terms of both performance level achieved and organizations’ focus on searching for novel solutions.
6.2 Sensitivity to intra-organizational complexity
The next step of analysis considers simulation results for all levels of intra-organizational complexity from \(K^{ex}=(0,\ldots, 5)\). Thereby, we intend to provide more detailed insights into potential differences of satisficing behavior compared to hill-climbing search. For this, Fig. 6 displays—for the three search strategies under investigation—(a) the performance level achieved on average of 2500 runs in the last period of observation, (b) the relative frequency of runs in which the global maximum was found in the last period, and (c) the average ratio of periods in which the organizations implement a new solution to their decision problem.
The results reveal that, for all search strategies, the final level of performance decreases with increasing intra-organizational complexity, which is broadly in line with prior research (e.g., Rivkin and Siggelkow 2007; Levinthal 1997). However, as shown in Fig. 6a, the search strategies are differently sensitive to an increase in intra-organizational complexity. The HC2-strategy—allowing only two alternatives and with just 1-bit changes each—is comparably robust with about 8.5 percentage points (p.p.) between highest and lowest final performance. In contrast, this difference is about 25 p.p. with satisficing and 34 p.p. with the HC6-strategy. Hence, these strategies—allowing for more alternatives considered and longer jumps—are notably sensitive to intra-organizational complexity in terms of performance declines.
These results might be counter-intuitive since one may expect that search strategies allowing to consider more alternatives and making even longer jumps outperform the HC2-strategy since this strategy is much more “restrictive” regarding search space and extent of change. Moreover, concerning satisficing the result is particularly interesting: with this strategy, the decision-makers sequentially discover and ex-ante evaluate alternatives—and this with increasing Hamming distances starting with two options with 1-bit changed. Hence, intuition may suggest that satisficing should not perform worse but even more successfully than the HC2-strategy. Moreover, it is worth mentioning that the satisficing strategy tends to show higher ratios of locating the optimal solution as Fig. 6b suggests.
The more extensive search spaces and longer jumps employed in satisficing—and likewise with the HC6-strategy—result in a remarkable increase in alterations as shown in Fig. 6c. For example, for high intra-organizational complexity (\(K^{ex}=5\)), with satisficing in about 83% of the periods and with HC6 hill-climbing in almost every period, another solution for the overall decision problem is implemented.
An interesting question is what causes these effects. The explanation may lie in the destabilization of the search when the strategy allows for more alternatives and long jumps as is the case with satisficing and HC6 hill-climbing. In particular, interactions among managers’ sub-problems and imperfect information at the managers’ site subtly interfere. Each manager \(r=(1,\ldots, M)\)—when making its decision in t without knowing what the fellow managers intend to do—may not only have been surprised by the actual performance \(P^r\) achieved in \(t-1\). Moreover, the fellow managers’ choices in \(t-1\) which—due to intra-organizational interactions—have affected r’s performance in \(t-1\) may be another source of surprise for manager r. This eventually lets manager r adapt configuration \(\mathbf {d}^r_{t}\) and so forth—resulting in frequent time-delayed mutual adjustments. Hence, search behavior that is more flexible in terms of more options and longer jumps makes it more likely that a manager discovers alternatives that (eventually falsely) promise to increase r’s performance. In this sense, the flexibility of search may induce some harmful “hyperactivity” of searching when intra-organizational complexity increases. The ratios of alterations increasing in the intra-organizational complexity with satisficing, or HC6 provide support for this conjecture (Fig. 6c).
These considerations may be summarized as follows: Search behavior that is more flexible in terms of considering a higher number of options and longer jumps as captured in satisficing is more prone to destabilizing (“hyperactive”) mutual adjustments than more restrictive forms of search behavior.
As mentioned before, prior research often employs algorithms like our HC2-strategy to represent local search for superior solutions to organizations’ overall decision problem. In doing so, prior research puts considerable emphasis on complexity, i.e., interactions within the overall decision problem. The sensitivity analysis presented here suggests that satisficing search is remarkably more sensitive to intra-organizational complexity than local search via hill-climbing. This appears particularly relevant since satisficing has received considerable support in behavioral experiments (see Sects. 1 and 2.3), thus, maybe a more realistic computational representation of managerial search behavior than hill-climbing algorithms.
7 Conclusion
At the center of this paper are the questions of representing managerial search behavior in computational models and how the representation may affect models’ results. Prior research questions that hill-climbing algorithms—predominating in computational organization science—represent managerial search behavior appropriately. At the same time, there is considerable empirical and experimental evidence on the relevance of satisficing behavior in actual human behavior. This serves as a starting for this paper which makes two contributions.
First, the paper introduces an algorithmic representation for backward-looking search according to Simon’s concept of satisficing (Simon 1955). The satisficing algorithm may complement other models of managerial search in (agent-based) computational organization science and, in this sense, may contribute to the ongoing discussion on how to model human decision-makers (e.g., Gode and Sunder 1993; Chen 2012; Hommes 2006; Billinger et al. 2014; Puranam et al. 2015; Posen et al. 2018).
Second, in an agent-based simulation model of decision-making organizations, the proposed algorithm of satisficing is applied and contrasted to the steepest ascent variant of hill-climbing. Apart from decision-makers’ incomplete knowledge of the solution space, the model captures further aspects of bounded rationality. The simulation experiments suggest that, first, with satisficing for organizations already operating at a high performance level intense search activities may emerge. Second, oscillating aspiration levels (including accepting performance declines) and potentially destabilizing search activities may occur when intra-organizational complexity is high. Third, a sensitivity analysis reveals that satisficing is considerably more sensitive to intra-organizational complexity in terms of performance declines than hill-climbing algorithms.
In sum, from a more general perspective, the results suggest that the type of search algorithm the decision-making agents employ (i.e., whether they follow the satisficing concept or a hill-climbing approach) may subtly shape the model’s behavior. These findings may shed some new light on prior modeling efforts building on hill-climbing algorithms, and may even suggest to revisit the respective computational studies in future research efforts (in a similar vein, Tracy et al. 2017).
The simulations presented in this paper require relativizing remarks, which also point to future research activities. First of all, it has to be emphasized that the satisficing concept captures some more modeling choices and parameter settings than typically showing up for hill-climbing. This applies particularly to the search sequence and the adjustments of aspiration levels and of the maximum number of options. For example, the simulations presented in this paper assume a “closest first” sequence of search and exponential weighting with equal focus on past and presence for the adjustments “built-in” in the satisficing concept. While there is considerable experimental evidence on this proximity-driven “closest first” sequence, other types of sequence may apply depending on the particular context (Puranam et al. 2015; Greve 2003). Moreover, there is a considerable body of empirical and experimental research on aspiration levels, and especially on the interrelation of organizational and individual aspirations in conjunction with the adjustments of aspirations (e.g., Lant 1992; Lant and Shapira 2008; Washburn and Bromiley 2012; Joseph and Gaba 2015). Hence, an obvious further step would be to explore the effects of satisficing on model behavior for a broader parameter space with a particular focus on the reference to the reality of managerial decision-making.
Moreover, the simulation model introduced in this paper captures relatively simple—for not to say: simplistic—organizations. In particular, the organizational arrangements do not comprise much more than the division of labor (i.e., decomposition into sub-problems and delegation to subordinate managers) and a simple incentive scheme that rewards parochial performance. Hence, an interesting question is how satisficing search behavior shapes results for organizations with more sophisticated institutional arrangements. Of interest may be, for example, how different coordination mechanisms could counteract the destabilizing effects of satisficing in the case of higher levels of intra-organizational complexity compared to hill-climbing. Studying the satisficing algorithm in models of more sophisticated organizational arrangements will, on the one hand, contribute to linking this representation of managerial search behavior to prior research in computational organization theory; on the other hand, it would allow studying the effects of satisficing in more realistic models of organizations.
Notes
The term satisficing results from a merger of the two words: satisfying and sufficing in the sense that in the process a solution is found which is both satisfying and sufficing (Hoffrage and Reimer 2004).
In Sect. 4, this paper presents simulation experiments for managerial decision-makers within organizations. The organizations face a binary decision problem according to the NK-framework. The subsequent description of satisficing search behavior follows some notational conventions of the NK framework.
For example, with a decision problem of \(N^r=3\), three alternatives to the status quo with a Hamming distance \(h(\mathbf {d}^{s^r}_t)=1\), three alternatives with \(h(\mathbf {d}^{s^r}_t)=2\) and one with \(h(\mathbf {d}^{s^r}_t)=3\) exist. A manager first discovers nearest neighbors; next, options with \(h(\mathbf {d}^{s^r}_t)=2\) are found etc. where the sequence among equal-distanced options is randomly given.
The likewise applies to the headquarters’ actions: The model does not explicitly capture the headquarters’ “thinking” actions (i.e., decomposition of the overall task, observation of the status quo), the “command” actions (i.e., delegation of sub-problems) and execution action (i.e., rewarding) which is why the they are not further documented in the Appendix.
With \(N \in {\mathbb {N}}\) this requires that N is divisible by M without remainder.
In this sense, the HCA could be regarded as a special case according to Eq. 6 with \(\beta =0\).
With this, the HCA may be regarded as a special case according to Eq. 5 with \(\alpha =0\).
The observation period T was fixed based on pretests which indicate that the results do not principally change for longer observation periods.
Hence, in the HC6-strategy each manager r identifies 6 out of the 7 possible alternatives to the \(N^r=3\)-dimensional partial decision problem, see fn. 5. The only option that is not feasible is switching each bit of the 3-dimensional sub-problem of each manager. The space of alternatives considered could also be regarded as indication on managers’ capabilities as in Rivkin and Siggelkow (2003).
In the analysis of simulation experiments, the metrics related to performance \(V_t\) are given relative to the global maxima of the respective performance landscapes: otherwise, the results were not comparable across different performance landscapes.
In other words: When decision-makers can evaluate options perfectly, in the decomposable structure, the aspiration levels do not oscillate around zero. Instead, after an initial incline, they decline to equal zero.
However, the search space is at a lower level than with satisficing in the decomposable structure. This is because, in the decomposable structure, high levels of performance are found very quickly. With this, further performance enhancements are difficult to achieve, which leads to an extension of the search space close to the upper bound.
References
Altenberg L (1997) Section B2.7.2: NK fitness landscapes. In: Back T, Fogel D, Michalewicz Z (eds) The handbook of evolutionary computation. Oxford University Press, Oxford, pp B2.7:5–B2.7:10
Astebro T, Herz H, Nanda R, Weber RA (2014) Seeking the roots of entrepreneurship: insights from behavioral economics. J Econ Perspect 28(3):49–70
Baumann O, Schmidt J, Stieglitz N (2019) Effective search in rugged performance landscapes: a review and outlook. J Manag 45(1):285–318
Bianchi M (1990) The unsatisfactoriness of satisficing: from bounded rationality to innovative rationality. Rev Polit Econ 2(2):149–167
Billinger S, Stieglitz N, Schumacher TR (2014) Search on rugged landscapes: an experimental study. Organ Sci 25(1):93–108
Böergers T, Sarin R (2000) Naive reinforcement learning with endogenous aspirations. Int Econ Rev 41(4):921–950
Brown R (2004) Consideration of the origin of Herbert Simon’s theory of “satisficing” (1933–1947). Manag Decis 42(10):1240–1256
Burton RM, Obel B (2011) Computational modeling for what-is, what-might-be, and what-should-be studies and triangulation. Organ Sci 22(5):1195–1202
Caplin A, Dean M, Martin D (2011) Search and satisficing. Am Econ Rev 101(7):2899–2922
Carley K (1992) Organizational learning and personnel turnover. Organ Sci 3(1):20–46
Carley KM, Zhiang L (1997) A theoretical study of organizational performance under information distortion. Manag Sci 43(July):976–997
Chang M-H, Harrington JE (1998) Organizational structure and firm innovation in a retail chain. Comput Math Organ Theory 3(4):267–288
Chang M-H, Harrington JE (2006) Agent-based models of organizations. In: Tesfatsion L, Judd KL (eds) Handbook of computational economics: agent-based computational economics, vol 2. book section 26. Amsterdam, Elsevier, pp 1273–1337
Chen S-H (2012) Varieties of agents in agent-based computational economics: a historical and an interdisciplinary perspective. J Econ Dyn Control 36(1):1–25
Conlisk J (1996) Why bounded rationality? J Econ Lit 34(2):669–700
Cormen TH, Leiserson CE, Rivest RL, Stein C (2009) Introduction to algorithms, 3rd edn. MIT Press, Cambridge
Csaszar FA (2018) A note on how NK landscapes work. J Organ Des 7(1):15
Cyert RM, March JG (1963) A behavioral theory ot the firm. Prentice Hall, Englewood Cliffs
Denrell J, March JG (2001) Adaptation as information restriction: the hot stove effect. Organ Sci 12(5):523–538
Dosi G, Levinthal D, Marengo L (2003) Bridging contested terrain: linking incentive-based and learning perspectives on organizational evolution. Ind Corp Change 12(2):413–436
Ellemers N, Gilder DD, Haslam SA (2004) Motivating individuals and groups at work: a social identity perspective on leadership and group performance. Acad Manag Rev 29(3):459–478
Ethiraj SK, Levinthal D (2004) Modularity and innovation in complex systems. Manag Sci 50(2):159–173
Galbraith JR (1973) Designing complex organizations. Addison-Wesley, Reading
Galbraith JR (1974) Organization design: an information processing view. Interfaces 4(3 (May)):28–36
Ganco M, Hoetker G (2009) NK modeling methodology in the strategy literature: bounded search on a rugged landscape. In: Bergh DD, Ketchen DJ (eds) Research methodology in strategy and management. Emerald, Bingley, pp 237–268
Gao D, Akbaritabar A (2021) Using agent-based modeling in routine dynamics research: a quantitative and content analysis of literature. Rev Manag Sci. https://doi.org/10.1007/s11846-021-00446-z
Gao D, Squazzoni F, Deng X (2018) The role of cognitive artifacts in organizational routine dynamics: an agent-based model. Comput Math Organ Theory 24(4):473–499
Gavetti G, Levinthal D (2000) Looking forward and looking backward: cognitive and experiential search. Adm Sci Q 45:113–137
Gigerenzer G (2002) The adaptive toolbox. In: Gigerenzer G, Selten R (eds) Bounded rationality: the adaptive toolbox. MIT Press, Cambridge, pp 37–50
Gigerenzer G (2004) Striking a blow for sanity in theories of rationality. In: Augier M, March JG (eds) Models of a man: essays in memory of Herbert A. Simon. MIT Press, Cambridge, pp 389–409
Gigerenzer G, Gaissmaier W (2011) Heuristic decision making. Annu Rev Psychol 62(1):451–482
Gigerenzer G, Todd PM (1999) Simple heuristics that make us smart. Oxford University Press, Oxford
Gode DK, Sunder S (1993) Allocative efficiency of markets with zero-intelligence traders: market as a partial substitute for individual rationality. J Polit Econ 101(1):119–137
Greve HR (2003) Organizational learning from performance feedback: a behavioral perspective on innovation and change. Cambridge University Press, New York
Güth W (2007) Satisficing in portfolio selection—theoretical aspects and experimental tests. J Socio-Econ 36(4):505–522
Güth W (2010) Satisficing and (un)bounded rationality: a formal definition and its experimental validity. J Econ Behav Organ 73(3):308–316
Hoffrage U, Reimer T (2004) Models of bounded rationality: the approach of fast and frugal heuristics. Manag Rev 15:437–459
Hommes CH (2006) Heterogeneous agent models in economics and finance. In: Tesfatsion L, Judd KL (eds) Handbook of computational economics: agent-based computational economics, vol 2. book section 23. Elsevier, Amsterdam, pp 1109–1186
Jain A, Kogut B (2014) Memory and organizational evolvability in a neutral landscape. Organ Sci 25(2):479–493
Joseph J, Gaba V (2015) The fog of feedback: ambiguity and firm responses to multiple aspiration levels. Strateg Manag J 36(13):1960–1978
Kauffman SA (1993) The origins of order: self-organization and selection in evolution. Oxford University Press, Oxford
Kauffman SA, Levin S (1987) Towards a general theory of adaptive walks on rugged landscapes. J Theor Biol 128(1 (September)):11–45
Knudsen T, Levinthal DA (2007) Two faces of search: alternative generation and alternative evaluation. Organ Sci 18(1):39–54
Kollman K, Miller JH, Page SE (2000) Decentralization and the search for policy solutions. J Law Econ Organ 16(1):102–128
Lant TK (1992) Aspiration level adaptation: an empirical exploration. Manag Sci 38(5):623–644
Lant T, Shapira Z (2008) Managerial reasoning about aspirations and expectations. J Econ Behav Organ 66(1):60–73
Levinthal DA (1997) Adaptation on rugged landscapes. Manag Sci 43(7):934–950
Levinthal DA (2016) Learning and adaptation. In: Augier M, Teece DJ (eds) The Palgrave encyclopedia of strategic management. Palgrave Macmillan UK, London, pp 1–5
Levinthal DA, March JG (1981) A model of adaptive organizational search. J Econ Behav Organ 2:307–333
Levitan B, Kauffman SA (1995) Adaptive walks with noisy fitness measurements. Mol Divers 1(1 (September)):53–68
Li R, Emmerich MTM, Eggermont J, Bovenkamp EGP, Bäck T, Dijkstra J, Reiber JHC (2006) Mixed-integer nk landscapes. In: Runarsson T, Beyer H-G, Burke E, Merelo-Guervós JJ, Whitley LD, Yao X (eds) Parallel problem solving from nature—PPSN IX, vol 4193. Lecture notes in computer science, book section 5. Springer, Berlin, pp 42–51
Li X, Pu W, Zhao X (2019) Agent action diagram: toward a model for emergency management system. Simul Model Pract Theory 94:66–99
Li X, Zhang W, Zhao X, Pu W, Chen P, Liu F (2021) Wartime industrial logistics information integration: framework and application in optimizing deployment and formation of military logistics platforms. J Ind Inf Integr 22:100201
Lorscheid I, Heine B-O, Meyer M (2012) Opening the “black box” of simulations: increased transparency and effective communication through the systematic design of experiments. Comput Math Organ Theory 18(1):22–62
Macken CA, Hagan PS, Perelson AS (1991) Evolutionary walks on rugged landscapes. SIAM J Appl Math 51(3):799–827
Mercier H (2017) Confirmation bias–myside bias. In: Pohl RF (ed) Cognitive illusions, book section, vol 5, 2nd edn. Routledge, New York, pp 99–114
Mezias SJ (1988) Aspiration level effects: an empirical investigation. J Econ Behav Organ 10(4):389–400
Parker AM, De Bruin WB, Fischhoff B (2007) Maximizers versus satisficers: decision-making styles, competence, and outcomes. Judgm Decis Mak 2(6):342
Posen HE, Keil T, Kim S, Meissner FD (2018) Renewing research on problemistic search—a review and research agenda. Acad Manag Ann 12(1):208–251
Puranam P, Stieglitz N, Osman M, Pillutla MM (2015) Modelling bounded rationality in organizations: progress and prospects. Acad Manag Ann 9(1):337–392
Radner R (1975) Satisficing. In: Optimization techniques IFIP technical conference. Springer, Berlin, pp 252–263
Redman TC (1998) The impact of poor data quality on the typical enterprise. Commun ACM 41(2):79–82
Rivkin JW, Siggelkow N (2003) Balancing search and stability: interdependencies among elements of organizational design. Manag Sci 49(3):290–311
Rivkin JW, Siggelkow N (2007) Patterned interactions in complex systems: implications for exploration. Manag Sci 53(July):1068–1085
Rosenfeld A, Kraus S (2012) Modeling agents based on aspiration adaptation theory. Auton Agents Multi-Agent Syst 24(2):221–254
Russell SJ, Norvig P (2016) Artificial intelligence: a modern approach. Pearson Education Limited, Singapore
Safarzyńska K, van den Bergh J (2010) Evolutionary models in economics: a survey of methods and building blocks. J Evol Econ 20(3):329–373
Schwartz H (2008) The role of aspirations and aspirations adaptation in explaining satisficing and bounded rationality. J Socio-Econ 37(3):949–957
Schwartz B, Ward A, Monterosso J, Lyubomirsky S, White K, Lehman DR (2002) Maximizing versus satisficing: happiness is a matter of choice. J Pers Soc Psychol 83(5):1178–1197
Selman B, Gomes CP (2006) Hill-climbing search. In: Nadel L (ed) Encyclopedia of cognitive science. Wiley. https://doi.org/10.1002/0470018860.s00015
Selten R (1998) Aspiration adaptation theory. J Math Psychol 42(2–3):191–214
Selten R (2002) What is bounded rationality? In: Gigerenzer G, Selten R (eds) Bounded rationality: the adaptive toolbox, book section 2. MIT Press, Cambridge, pp 13–36
Siggelkow N, Levinthal DA (2003) Temporarily divide to conquer: centralized, decentralized, and reintegrated organizational approaches to exploration and adaptation. Organ Sci 14(6):650–669
Siggelkow N, Rivkin JW (2005) Speed and search: designing organizations for turbulence and complexity. Organ Sci 16(2):101–122
Simon HA (1955) A behavioral model of rational choice. Q J Econ 69(September):99–118
Simon HA (1959) Theories of decision-making in economics and behavioral science. Am Econ Rev 49(3):253–283
Simon HA (1962) The architecture of complexity. Proc Am Philos Soc 106(6):467–482
Simon HA (1979) Rational decision making in business organizations. Am Econ Rev 69(4):493–513
Smarzhevskiy IA, Solovev DB (2020) Model of organizational behavior in a hierarchical structure. In: IOP conference series: earth and environmental science, vol 459. IOP Publishing, p 042003
Staw BM (1981) The escalation of commitment to a course of action. Acad Manag Rev 6(4):577–587
Stigler GJ (1961) The economics of information. J Polit Econ 69(3):213–225
Tee SW, Bowen PL, Doyle P, Rohde FH (2007) Factors influencing organizations to improve data quality in their information systems. Account Finance 47(2):335–355
Tesfatsion L (2003) Agent-based computational economics: modeling economies as complex adaptive systems. Inf Sci 149(4):262–268
Thompson JD (1967) Organizations in action. Social science bases of administrative theory, McGraw-Hill, New York
Todd PM, Gigerenzer G (2003) Bounding rationality to the world. J Econ Psychol 24(2):143–165
Tracy WM, Markovitch DG, Peters LS, Phani BV, Philip D (2017) Algorithmic representations of managerial search behavior. Comput Econ 49(3):343–361
von Neumann J, Morgenstern O, Kuhn HW (2007) Theory of games and economic behavior (commemorative edition). Princeton University Press, Princeton
Wall F (2010) The (beneficial) role of informational imperfections in enhancing organisational performance. In: Li Calzi M, Milone L, Pellizzari P (eds) Progress in artificial economics, vol 645. Lecture notes in economics and mathematical systems, book section 10. Springer, Berlin, pp 115–126
Wall F (2016) Agent-based modeling in managerial science: an illustrative survey and study. Rev Manag Sci 10(1):135–193
Wall F (2017) Learning to incentivize in different modes of coordination. Adv Complex Syst 20(2–3):1–29
Wall F (2018) Emergence of task formation in organizations: balancing units’ competence and capacity. J Artif Soc Soc Simul 21(2):1–25
Washburn M, Bromiley P (2012) Comparing aspiration models: the role of selective attention. J Manag Stud 49(5):896–917
Wright S (1932) The roles of mutation, inbreeding, crossbreeding, and selection in evolution. In: 6th international congress of genetics. pp 356–366
Za S, Spagnoletti P, Winter R, Mettler T (2018) Exploring foundations for using simulations in IS research. Commun Assoc Inf Syst 42(Feb.):268–300
Funding
Open access funding provided by University of Klagenfurt.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The author declares that she has no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendix
Appendix
This appendix provides an overview of how the artificial organizations are mapped in the multi-agent simulation, according to the approach of the agent action diagram (AAD) as introduced by Li et al. (2019). Among the particular features of the AAD approach is the differentiation between agents’ “thinking”, “command”, and “behavior” actions. However, it is worth mentioning that, in the “rudimentary” organizational structure in our model as reflecting the paper’s particular focus, the command and behavior actions are only captured implicitly in the model (for this, see also Sect. 4.1). Figure 7 shows (a) the agent entity organization diagram and (b) the entity action attribute diagram for the managerial decision-makers (for the headquarters, see footnote 6) according to the AAD concept (Li et al. 2019).
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Wall, F. Modeling managerial search behavior based on Simon’s concept of satisficing. Comput Math Organ Theory 29, 265–299 (2023). https://doi.org/10.1007/s10588-021-09344-x
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10588-021-09344-x