1 Introduction

The term participatory budgeting (PB) has been used to describe a range of mechanisms that directly involve citizens in public spending decisions [9]. The basic idea is that people can vote on grassroots projects (e.g., building a playground or funding a cultural event), each of which has a certain cost. The most popular projects—that fit a given budget constraint—then get funded. In recent years, PB has flourished around the world, making it one of the most popular tools of participatory democracy. At the same time, it also has received a lot of attention in the literature on (computational) social choice [3, 30].

Given the votes of the citizens, it is not always obvious how to decide which projects to fund (in other words, there are many different voting rules one could consider using). The common approach to choosing a voting rule is the normative approach, where we encode normatively desirable properties of voting rules, e.g., properties related to fairness, in the form of axioms and then analyse to what extent a given voting rule will satisfy those axioms. In this paper, we instead explore what is known as the epistemic approach—or truth-tracking approach—to the analysis and design of voting rules for PB. Both approaches play a central role in the broader field of computational social choice [7]. But in the specific context of PB, the epistemic approach may, at first sight, seem less natural a choice. So let us briefly outline why we nonetheless believe that it is important to explore its potential.

One reason is that, even in the familiar setting of PB with citizens voting for projects they personally would like to receive public funding, we might interpret these votes as evidence for objective quality. Indeed, whether a given project is a success will often become clear only some time after it has been realised: Will local citizens actually use the compost bins? Will the number of mosquitoes go down? Will the new speed camera reduce the number of accidents?Footnote 1 So we might think of the citizens casting their votes as agents with bounded rationality who enjoy a noisy view of this ground truth. They do not know what the best set of projects to fund is, but each of them is more likely to vote for a good rather than a bad set of projects.

Furthermore, PB as a means to select projects to receive public funding is but one example of a selection process of costly alternatives. For some other such processes—that are, at least mathematically, equivalent to PB—the existence of a ground truth may be more obvious. Consider, for instance, the case of the Eterna platform.Footnote 2 On this collaborative platform, users can submit different ways of folding a given protein. A subset of the proposed configurations is then synthesised in a laboratory to determine which are most stable. One can think of this as a PB process: the projects are the different protein foldings; their cost is the cost of synthesising them; the budget limit is the amount of money that is allocated to this process; and finally, the protein foldings submitted by a user constitute their approval ballot. Mathematically speaking, this is thus a well-defined PB process. Moreover, there is a clear ground truth here: a set of objectively most stable protein configurations. Interestingly, this is also the motivating example of the first epistemic analysis of multi-winner voting rules,Footnote 3 though in a setting without costs [27].

A last example is that of a selection committee for research grant proposals. In such a committee, the members have to decide which of the grants should be funded. The grants typically have different costs, and there is a maximum amount of money that can be allocated. Then one might argue that the proposals have a “ground truth” probability of success that can only be observed a posteriori. At selection time, the committee members observe noisy signals regarding this ground truth through the reports of reviewers, and they need to make a decision based on this information.

If we have a clear idea how these noisy views on the ground truth are generated (votes that are cast in the context of PB, protein configurations that are proposed for the EterRNA platform, or approvals submitted for some grant proposals)—that is, if we have a well-defined noise model—we, in principle, are able to design a voting rule that maximises the likelihood of returning the ground truth, i.e., the best set of projects that fit our budget. Of course, in practice we do not have access to this noise model. Still, if a natural voting rule turns out to be such a maximum likelihood estimator (MLE) for a natural choice of noise model, then we can interpret this as an argument for using that rule. Similarly, if we can prove that for a given rule there does not exist any noise model that would make that rule an MLE, then that rule will be less suitable in contexts where it is reasonable to assume that there exists some form of ground truth regarding the right decision to be taken.


Contribution. We first analyse the rule overwhelmingly used in practice around the world—the greedy cost approval rule—as well as the rules most studied in the recent literature—the Method of Equal Shares [8, 25] and the Sequential Phragmén Rule [24]. Using a necessary condition provided by [14], we prove that none of these rules can be interpreted as an MLE, even for instances where all projects have the same cost (corresponding to multi-winner voting instances).

We then turn to a family of rules that all satisfy the aforementioned necessary condition: additive argmax rules. These can be thought of as welfare-maximising rules. We focus on eight specific rules based on either utilitarian or Nash social welfare. In the case of utilitarian social welfare, we show that it is impossible to find a noise model for which the most natural rules would be MLEs in the general case. For Nash welfare, the picture is brighter, since for two rules we are able to identify noise models under which they are MLEs.

Still, the overall picture pained by our analysis is largely negative, demonstrating that it is difficult to design voting rules for PB with good epistemic performance, certainly if this requirement comes on top of other, particularly common normative, requirements. Identifying conditions under which more positive results can be achieved—and ideally conditions that can be satisfied for specific application scenarios where the epistemic perspective is most relevant—therefore presents itself as an important challenge for future work in the field.


Related work. The study of PB rules is part of social choice theory, which more generally deals with the design and analysis of voting rules for different kinds of scenarios. Given that for every application domain many different rules can be—and have been—devised, it can be hard for the decision maker to select the rule to be used. To assist in this choice, two main approaches have been developed. The first one, the normative or axiomatic approach [1, 34], tries to identify voting rules that satisfy certain normative requirements. The second one, the epistemic approach [15, 26], seeks out rules that can recover a ground truth, assuming that the votes received are noisy estimates of that ground truth. It is the latter approach we follow here.

Formal work on PB to date instead has followed the axiomatic approach, with a special focus on fairness [2, 8, 21, 22, 24, 25], incentive compatibility [16, 19, 20, 29], and monotonicity requirements [4, 28, 32]. We refer the reader to the survey by [30] for more details.

The epistemic approach has been first applied to the standard voting model [12, 14, 35]. Later on, other social choice scenarios have been investigated through the epistemic lens, notably multi-winner elections [13, 27], and judgment aggregation [5, 6, 33]. To the best of our knowledge, the only epistemic study of PB is the one section dedicated to the topic by [20], though in the context of divisible PB, i.e., when projects can be partially funded. Interestingly, they show that knapsack voting—a PB rule that resembles the greedy cost approval rule in the divisible setting—can be interpreted as a maximum likelihood estimator, while we will see that this is not the case for the greedy cost approval rule in the context of indivisible PB.

2 Preliminaries

In this section, we recall the standard model of PB, used in much of the literature [see, e.g., 3, 30, 32], and then define what it means for a PB rule to be a maximum likelihood estimator (MLE).

2.1 Participatory budgeting

A PB problem is described by an instance \(I = \left\langle \mathcal {P}, c, b \right\rangle\), where \(\mathcal {P}\) is the set of available projects, \(c: \mathcal {P}\rightarrow \mathbb {N}\) is the cost function—mapping any given project \(p \in \mathcal {P}\) to its cost \(c(p) \in \mathbb {N}\)—and \(b \in \mathbb {N}\) is the budget limit. We write c(P) instead of \(\sum _{p \in P} c(p)\) for sets of projects \(P \subseteq \mathcal {P}\).

For a given PB instance, we ask several agents to each submit an approval ballot \(A \subseteq \mathcal {P}\), resulting in a vector \(A\) of ballots, one for each agent. Such a vector of approval ballots is called a profile. Given two profiles \(A\) and \({A^\prime }\), we use \(A{{\,\mathrm{\oplus }\,}}{A^\prime }\) to denote the profile obtained by concatenating them.

Given an instance \(I = \left\langle \mathcal {P}, c, b \right\rangle\), we need to select a subset of projects \(\pi \subseteq \mathcal {P}\) to implement. Such a budget allocation \(\pi\) has to be feasible, i.e., we require \(c(\pi ) \le b\). Let \(\mathcal {A}(I) = \{\pi \subseteq \mathcal {P}\mid c(\pi ) \le b\}\) be the set of feasible budget allocations for I. Moreover, let \(\mathcal {A}_{ EX }(I)\) be the set of all exhaustive budget allocations for I, i.e., of allocations \(\pi \in \mathcal {A}(I)\) for which there is no project \(p \in \mathcal {P}\setminus \pi\) such that \(c(\pi \cup \{p\}) \le b\).

Computing budget allocations is done by means of PB rules. An irresolute PB rule \(F\) is a function that takes as input a PB instance I and a profile \(A\) over I, and that returns a nonempty set of feasible budget allocations \(F(I, A) \subseteq \mathcal {A}(I)\). A rule is exhaustive if \(F(I, A) \subseteq \mathcal {A}_{ EX }(I)\) for all I and \(A\).

Some of our results will apply only to unit-cost instances. An instance \(I = \left\langle \mathcal {P}, c, b \right\rangle\) is a unit-cost instance if there exists an \(\ell \in \mathbb {N}\) such that (i\(c(p) = \ell\) for all projects \(p \in \mathcal {P}\) and (ii\(b \equiv 0 \bmod \ell\). This restriction is particularly interesting since unit-cost instances are equivalent to multi-winner voting problems where one needs to elect a committee of some fixed size k [18]. Candidates can be thought of as projects of cost \(\ell\), so under a budget limit of \(b=k\cdot \ell\) exhaustive budget allocations correspond to such committees.

2.2 The truth-tracking perspective

Under the truth-tracking perspective, we assume that, for every given instance I, there exists an objectively best feasible budget allocation, the so-called ground truth \(\pi ^\star\), and we would like every reasonable rule to select \(\pi ^\star\). The ground truth is not known, neither to the agents nor to the decision maker. We will thus assess the quality of PB rules based on their ability to retrieve the ground truth given noisy votes.

Formally, a noise model \(\mathcal {M}\) is a generative model that produces random approval ballots for a given instance and ground truth. We represent it as a probability distribution over all approval ballots. For a given instance \(I = \left\langle \mathcal {P}, c, b \right\rangle\), ground truth \(\pi ^\star \in \mathcal {A}(I)\), and approval ballot \(A \subseteq \mathcal {P}\), we denote by \(\mathbb {P}_\mathcal {M}(A \mid \pi ^\star , I)\) the probability for the noise model \(\mathcal {M}\) to generate ballot A given I and \(\pi ^\star\). For profiles generated by the noise model, ballots are drawn identically and independently from \(\mathcal {M}\).

Suppose the noise model \(\mathcal {M}\) indicates how the voters form their preferences for any given ground truth.Footnote 4 Then a good rule should select the outcome that most likely would have generated the observed profile we observe if it were the ground truth plugged into \(\mathcal {M}\). This is the maximum likelihood estimator (MLE) for \(\mathcal {M}\).

Definition 1

(Maximum likelihood estimators) For a noise model \(\mathcal {M}\), the likelihood of a profile \(A\) over the instance I and a budget allocation \(\pi \in \mathcal {A}(I)\) is defined as:

$$\begin{aligned} L_{\mathcal {M}}(A, \pi , I) = \prod _{A \in A} \mathbb {P}_\mathcal {M}(A \mid \pi , I). \end{aligned}$$

A PB rule \(F\) is said to be the MLE for \(\mathcal {M}\), if for every instance I and every profile \(A\) we have:

$$\begin{aligned} F(I, A) = \mathop {\textrm{argmax}}\limits _{\pi ^\star \in \mathcal {A}(I)} L_{\mathcal {M}}(A, \pi ^\star , I). \end{aligned}$$

In the context of the standard model of voting theory, [14] identified a necessary condition for a voting rule to be interpretable as an MLE: it should satisfy what we are going to call weak reinforcement. This result straightforwardly carries over to the PB setting.

Definition 2

(Weak reinforcement) A PB rule \(F\) is said to be satisfying weak reinforcement if and only if, for every instance I and every two profiles \(A\) and \({A^\prime }\), we have:

$$\begin{aligned} F(I, A) = F(I, {A^\prime }) \;\Longrightarrow \; F(I, A{{\,\mathrm{\oplus }\,}}{A^\prime }) = F(I, \varvec{A}). \end{aligned}$$

Lemma 1

([14, 14]) If a PB rule \(F\) does not satisfy weak reinforcement, then there exists no noise model \(\mathcal {M}\) for which \(F\) is the MLE.

Note that this result applies for any set of possible ground truths, so also if we assume the ground truth to be exhaustive.

As a first application of this lemma, let us consider what perhaps is the most widely used rule in real-world PB elections, namely the greedy cost approval rule \(F_{ greed }\) [3, 20]. Under this rule, we select projects in order of the number of approvals they receive, skipping over any projects that would lead us to exceed the budget.Footnote 5 Note that \(F_{ greed }\) is exhaustive. Unfortunately, we can easily show that it cannot be interpreted as an MLE.

Proposition 2

There exists no noise model \(\mathcal {M}\) such that the greedy cost approval rule \(F_{ greed }\) is the MLE for \(\mathcal {M}\).

Proof

Consider an instance I with three projects denoted by \(p_1\), \(p_2\) and \(p_3\), a budget limit of \(b = 4\), and costs as shown in the table below. Moreover, consider two profiles \(A\) and \({A^\prime }\) in which the approval scores are as presented below.

 

Cost

Approval score

in \(A\)

Approval score

in \({A^\prime }\)

Approval score

in \(A{\mkern 1mu} \oplus {\mkern 1mu} {A^\prime }{\rm{ }}\)

\(p_1\)

2

10

1

11

\(p_2\)

2

1

10

11

\(p_3\)

3

9

9

18

One can easily check that \(F_{ greed }\) would return \(\{\{p_1, p_2\}\}\) on both \(\varvec{A}\) and \(\varvec{A}'\). However, on the joint profile \(\varvec{A}{{\,\mathrm{\oplus }\,}}\varvec{A}'\), it would return \(\{\{p_3\}\}\). Weak reinforcement is thus violated. The claim now follows from Lemma 1. \(\square\)

3 Proportional PB rules

A large part of recent research on PB has been devoted to the study of proportional rules, i.e., rules that treat groups of agents fairly. In this section, we focus on the most prominent ones—Sequential Phragmén [8, 24] and approval-based variants of the Method of Equal Shares (MES) [25]—and show that they also cannot be interpreted as MLEs.

Definition 3

(Sequential Phragmén) Given an instance I and a profile \(\varvec{A}\), the Sequential Phragmén rule constructs budget allocations \(\pi\) using the following continuous process.

Initially, \(\pi\) is the empty set. Voters receive money in a virtual currency. They all start with a budget of 0 and that budget continuously increases as time passes. At time t a voter will have received an amount t of money. For any time t, let \(P^\star _t\) be the set of projects \(p \in \mathcal {P}\) for which the approvers together have at least c(p) money available. As soon as, for a given t, the set \(P^\star _t\) is non-empty, if there exists a \(p \in P^\star _t\) such that \(c(\pi \cup \{p\}) > b\), the process stops; otherwise one project from \(P^\star _t\) is selected, the budget of its approvers is set to 0, and the process resumes.

The outcome of Sequential Phragmén is the set of all budget allocations constructed by the procedure above (for all possible ways of breaking ties between the projects in \(P^\star _t\)).

Observe that, due to the fact that each voter’s budget increases continuously, the approvers whose project p is selected at a given time t in fact will have accumulated exactly c(p), not more, which justifies setting their budgets back all the way to 0. Also note that with the above stopping condition—required to guarantee a property known as priceability [24]—Sequential Phragmén is not exhaustive. But it is exhaustive on unit-cost instances.

Unfortunately, whatever assumptions we are willing to make regarding the noise model generating the approval sets of voters, Sequential Phragmén cannot be used to track the ground truth, not even for unit-cost instances.

Proposition 3

There exists no noise model \(\mathcal {M}\) such that Sequential Phragmén is the MLE for \(\mathcal {M}\), not even on unit-cost instances with the additional assumption that the ground truth is exhaustive.

Proof

Consider an instance I with four projects denoted by \(p_1\), \(p_2\), \(p_3\), and \(p_4\), all of cost 1, and budget limit of \(b = 3\). We consider two profile, \(\varvec{A}^1\) and \(\varvec{A}^2\), as presented below, where \(\varvec{A}^1\) is on the left and \(\varvec{A}^2\) is on the right.

 

\(p_1\)

\(p_2\)

\(p_3\)

\(p_4\)

 

\(p_1\)

\(p_2\)

\(p_3\)

\(p_4\)

Cost

1

1

1

1

Cost

1

1

1

1

\(A_1^1\)

\(A_1^2\)

\(A_2^1\)

\(A_2^2\)

\(A_3^1\)

\(A_3^2\)

\(A_4^1\)

\(A_4^2\)

\(A_5^1\)

\(A_5^2\)

\(b = 3\)

\(b = 3\)

We claim that on both profile \(\varvec{A}^1\) and profile \(\varvec{A}^2\), Sequential Phragmén outputs a single budget allocation, namely \(\pi = \{p_1, p_3, p_4\}\). Remember that, in the unit-cost setting, Sequential Phragmén is exhaustive. The budget allocation \(\pi\) indeed is exhaustive.

For \(\varvec{A}^1\), after \(\nicefrac {1}{2}\) units of money have been distributed, both \(p_3\) and \(p_4\) will have been bought at a price of \(\nicefrac {1}{4}\) each. Once an additional \(\nicefrac {1}{4}\) units of money have been injected, project \(p_1\) is bought as well, making \(\pi\) the unique budget allocation returned by Sequential Phragmén.

For \(\varvec{A}^2\), first \(p_1\) is bought at a price of \(\nicefrac {1}{4}\), then \(p_3\) at a price of \(\nicefrac {1}{3}\). Finally, after \(\nicefrac {1}{3}\) additional units of money have been distributed, project \(p_4\) is bought (at that time, the supporters of project \(p_2\) have collected \(\nicefrac {1}{4} + \nicefrac {2}{3} < 1\) money). The final outcome is thus indeed \(\{\pi \}\).

Now, consider the joint profile \(\varvec{A}^3 = \varvec{A}^1 {{\,\mathrm{\oplus }\,}}\varvec{A}^2\), and let us detail the run of Sequential Phragmén on I and \(\varvec{A}^3\). The first project to be bought is \(p_3\) at price \(\nicefrac {1}{7}\). Then, once an extra \(\nicefrac {5}{42}\) units of money have been distributed, project \(p_1\) is bought as well. At that time, the supporters of \(p_4\) who do not approve of \(p_2\) have no money since they approve of \(p_1\). On the other hand, the only supporter of \(p_2\) who does not approve of \(p_4\) has a strictly positive amount of money. Project \(p_2\) will then be the last project selected (after another \(\nicefrac {2}{21}\) units of money have been injected). Overall, the outcome will be \(\{\{p_1, p_2, p_3\}\} \ne \{\pi \}\).

We have thus proven that the Sequential Phragmén rule fails weak reinforcement. Lemma 1 then concludes the proof. Note that all budget allocations we considered are exhaustive; the result thus also applies if we only focus on exhaustive ground truths. \(\square\)

The same bad news apply to the MES-based rules. These rules are parametrised by a measure of the satisfaction of the voters. We call satisfaction function (on singletons), any mapping from projects p to satisfaction levels \(\mu (p) \in \mathbb {R}_{>0}\).Footnote 6 We can now define MES with respect to satisfaction function \(\mu\).

Definition 4

(\(\hbox {MES}_{\mu }\)) Given an instance I, a profile \(\varvec{A}= (A_1,\ldots ,A_n)\) with n agents, and a satisfaction function \(\mu\), \(\hbox {MES}_{\mu }\) constructs budget allocations \(\pi\), initially empty, iteratively as follows. Every agent i is initially assigned a budget \(b_i = \nicefrac {b}{n}\) of virtual money. Given a budget allocation \(\pi\), a project \(p \in \mathcal {P}\setminus \pi\) is said to be \(\alpha\)-affordable, for \(\alpha \in \mathbb {R}_{\ge 0}\) if:

$$\begin{aligned} \sum _{i \,\mid \, p \in A_i} \min (b_i, \alpha \cdot \mu (p)) \hspace{5.0pt}\ge \hspace{5.0pt}c(p). \end{aligned}$$

At a given round with current budget allocation \(\pi\), if no project is \(\alpha\)-affordable for any \(\alpha\), \(\hbox {MES}_{\mu }\) terminates. Otherwise, let \(P^\star\) be the set of projects that are \(\alpha ^*\)-affordable for a minimum \(\alpha ^*\). The rule selects one project \(p \in P^\star\) (\(\pi\) is updated to \(\pi \cup \{p\}\)), and every approver i of p sees their budget reduced by \(\min (b_i, \alpha \cdot \mu (p))\).

The outcome of \(\hbox {MES}_{\mu }\) is the set of all budget allocations constructed by the procedure above (for all possible ways of breaking ties between the projects in \(P^\star\)).

Note that \(\hbox {MES}_\mu\) fails to be an exhaustive rule, for any satisfaction function \(\mu\) and even on unit-cost instances.

We show that for no \(\mu\) can \(\hbox {MES}_{\mu }\) be interpreted as an MLE.

Proposition 4

For any given satisfaction function \(\mu\), there is no noise model \(\mathcal {M}\) such that \(\hbox {MES}_{\mu }\) is the MLE for \(\mathcal {M}\), not even on unit-cost instances.

Proof

Consider an instance I with two projects denoted by \(p_1\) and \(p_2\), both of cost 1, and a budget limit \(b = 2\). Let \(\mu\) be an arbitrary satisfaction function.

Consider the two profiles \(\varvec{A}^1 = (\{p_1\}, \{p_2\})\) and \(\varvec{A}^2 = (\{p_1, p_2\}, \{p_1, p_2\})\). We claim that on both of these profiles, \(\hbox {MES}_{\mu }\) would return \(\pi = \{\{p_1, p_2\}\}\). Indeed, on \(\varvec{A}^1\) both agents receive 1 unit of money and can both afford the project they approve of. On \(\varvec{A}^2\) both agents approve of all the projects and can afford them. Now, for \(\varvec{A}^3 = (\{p_1\}, \{p_2\}, \{p_1, p_2\}, \{p_1, p_2\}) = \varvec{A}^1 {{\,\mathrm{\oplus }\,}}\varvec{A}^2\), we claim that \(\hbox {MES}_{\mu }\) would return either \(\{\{p_1\}\}\), \(\{\{p_2\}\}\), or \(\{\{p_1\}, \{p_2\}\}\). Here the initial budget is \(\nicefrac {1}{2}\) for each agent. Thus, the approvers of \(p_1\) collectively have \(\nicefrac {3}{2}\) units of money, and the same is true for \(p_2\). Since \(\mu (p) > 0\) for both \(p_1\) and \(p_2\) (by definition of satisfaction functions) and since we can choose \(\alpha\) arbitrarily large, this implies that \(\hbox {MES}_{\mu }\) would select either \(p_1\) or \(p_2\) in the first round. Let \(p^\star\) be the selected project and p the other project. To buy \(p^\star\), all its approvers need to pay \(\nicefrac {1}{3}\). The approvers of p are thus now left with \(\nicefrac {1}{2} + 2 \cdot (\nicefrac {1}{2} - \nicefrac {1}{3}) = \nicefrac {1}{2} + \nicefrac {1}{3} < 1\), which is not enough to afford p. There is thus no way for \(\hbox {MES}_{\mu }\) to return \(\{\{p_1, p_2\}\}\) on \(\varvec{A}^3\).

We have thus shown that \(\hbox {MES}_{\mu }\) fails weak reinforcement. Lemma 1 then concludes the proof. \(\square\)

Note that for the proofs of both the results in this section, the outcomes on the initial profiles are always disjoint from the outcomes on the joint profiles. This implies that even resolute versions of the rules (obtained by introducing some form of tie-breaking), or any refinement, would also fail weak reinforcement.

4 Monotonic argmax rules

As we have seen, our first obstacle to finding rules that are MLEs is that none of the proportional rules we considered satisfy weak reinforcement. Aiming for more positive results, we will now follow a different approach: instead of checking whether known PB rules do satisfy weak reinforcement, we will start from rules we know satisfy it, and investigate their epistemic abilities. To this end, we will thus focus on monotonic argmax rules, a large class of rules, all of which satisfy weak reinforcement. We start by defining both what an argmax rule is and what makes such a rule monotonic.

Definition 5

(Monotonic Argmax Rules) A PB rule \(F\) is called an argmax rule if there exists a function f, taking as input an instance I, a profile \(\varvec{A}\), and a budget allocation \(\pi\), and returning a number \(f(I, \varvec{A}, \pi ) \in \mathbb {R}\), such that for all instances I and all profiles \(\varvec{A}\), we have:

$$\begin{aligned} F(I, \varvec{A}) = \mathop {\textrm{argmax}}\limits _{\pi \in \mathcal {A}(I)} f(I, \varvec{A}, \pi ). \end{aligned}$$

An argmax rule defined via the function f is called monotonic if for every two profiles \(\varvec{A}\) and \(\varvec{A}'\) and every two budget allocations \(\pi\) and \(\pi '\), the following two conditions hold:

  1. 1.

    \(\displaystyle \left. \begin{array}{r} f(I, \varvec{A}, \pi )< f(I, \varvec{A}, \pi ')\\ f(I, \varvec{A}', \pi )< f(I, \varvec{A}', \pi ') \end{array} \right\} \Longrightarrow f(I, \varvec{A}{{\,\mathrm{\oplus }\,}}\varvec{A}', \pi ) < f(I, \varvec{A}{{\,\mathrm{\oplus }\,}}\varvec{A}', \pi '),\)

  2. 2.

    \(\displaystyle \left. \begin{array}{r} f(I, \varvec{A}, \pi ) = f(I, \varvec{A}, \pi ')\\ f(I, \varvec{A}', \pi ) = f(I, \varvec{A}', \pi ') \end{array} \right\} \Longrightarrow f(I, \varvec{A}{{\,\mathrm{\oplus }\,}}\varvec{A}', \pi ) = f(I, \varvec{A}{{\,\mathrm{\oplus }\,}}\varvec{A}', \pi ').\)

Note that every rule is an argmax rule—f can simply be the indicator function on the outcome of the rule for a given instance and profile—but not all rules are monotonic.

Are the monotonic argmax rules good candidates for being MLEs? Yes, they are, as we can show that monotonic argmax rules all satisfy weak reinforcement.

Proposition 5

Every monotonic argmax rule satisfies weak reinforcement.

Proof

Consider the monotonic argmax rule \(F\) defined by the function f. Let I be an instance, and \(\varvec{A}\) and \(\varvec{A}'\) two profiles over I such that \(F(I, \varvec{A}) = F(I, \varvec{A}')\). We show that we also have \(F(I, \varvec{A}{{\,\mathrm{\oplus }\,}}\varvec{A}') = F(I, \varvec{A})\).

Consider a budget allocation \(\pi \in F(I, \varvec{A})\). Since \(F\) is an argmax rule, for all \(\pi ' \in \mathcal {A}(I) \setminus F(I, \varvec{A})\), we know that \(f(I, \varvec{A}, \pi ) > f(I, \varvec{A}, \pi ')\). This also holds for \(\varvec{A}'\). By the definition of a monotonic argmax rule, we immediately get that \(f(I, \varvec{A}{{\,\mathrm{\oplus }\,}}\varvec{A}', \pi ) > f(I, \varvec{A}{{\,\mathrm{\oplus }\,}}\varvec{A}', \pi ')\).

Moreover, for any two budget allocations \(\pi , \pi ' \in F(I, \varvec{A})\), we have \(f(I, \varvec{A}, \pi ) = f(I, \varvec{A}, \pi ')\). The same also holds for \(\varvec{A}'\). Since \(F\) is monotonic, we thus immediately get that \(f(I, \varvec{A}{{\,\mathrm{\oplus }\,}}\varvec{A}', \pi ) = f(I, \varvec{A}{{\,\mathrm{\oplus }\,}}\varvec{A}', \pi ')\).

Overall, we proved that (i) no budget allocation that was winning under \(\varvec{A}\) or \(\varvec{A}'\) is dominated under \(\varvec{A}{{\,\mathrm{\oplus }\,}}\varvec{A}'\), and (ii) that all budget allocations that were winning under \(\varvec{A}\) and \(\varvec{A}'\) all have the same score. It is then immediate that \(F(I, \varvec{A}{{\,\mathrm{\oplus }\,}}\varvec{A}') = F(I, \varvec{A})\). \(\square\)

In what follows we will introduce and study several concrete examples of monotonic argmax rules, based either on the Nash or on the utilitarian social welfare.

4.1 Nash social welfare

We first study rules based on the Nash social welfare. It tries to reach balanced outcomes by measuring the score of a budget allocation as the product of the agents’ levels of satisfaction. The concept of Nash social welfare provides strong guarantees in the context of the fair allocation of indivisible items [11]. It has also been identified as an appealing rule for PB [31].

4.1.1 Cardinality and cost satisfaction

We first consider two common measures of satisfaction: based on the cardinality and on the cost of approved and selected projects [30, 32]. This gives rise to two argmax rules, \(F^N_{ app }\) and \(F^N_{ cost }\), defined via the following functions:

$$\begin{aligned} f^N_{ app }(I, \varvec{A}, \pi )&= {\left\{ \begin{array}{ll} \sum _{A \in \varvec{A}} \log (|A \cap \pi |) & \textit{if for all } A' \in \varvec{A}, |A' \cap \pi | \ne 0, \\ 0 & \textit{otherwise.} \end{array}\right. } \\ f^N_{ cost }(I, \varvec{A}, \pi )&= {\left\{ \begin{array}{ll} \sum _{A \in \varvec{A}} \log (c(A \cap \pi )) & \textit{if for all } A' \in \varvec{A}, c(A' \cap \pi ) \ne 0, \\ 0 & \textit{otherwise.} \end{array}\right. }. \end{aligned}$$

It can be checked that these two argmax rules are indeed monotonic. From Proposition 5, we thus know that they satisfy weak reinforcement. Note that one would need to take the exponential of these function to make the Nash social welfare appear.

Interestingly, under the common and natural assumption that all projects are approved by at least one agent, these two rules are exhaustive.

We start our investigation by introducing a noise model, denoted by \(\mathcal {M}_{ Ncost }\), for which for all I, A and \(\pi ^\star\), we have:

$$\begin{aligned} \mathbb {P}_{\mathcal {M}_{ Ncost }} (A \mid \pi ^\star , I) = \frac{1}{Z_{\pi ^\star }^{ Ncost }} c(A \cap \pi ^\star ), \end{aligned}$$

where, \(Z_{\pi ^\star }^{ Ncost }\) is a suitable normalisation factor ensuring that \(\mathcal {M}_{ Ncost }\) is a well-defined probability distribution.

Under this noise model, the probability of generating a given ballot A increases with the cost of the ground-truth projects in A. The intuition here is that voters may reflect more carefully on expensive projects and thus are more likely to make correct choices for them. Moreover, the probability of generating A increases linearly in the “quality” of A. How realistic this is, of course, is open to debate. On the one hand, this avoids having to assume extremely high probabilities for correctly identifying particularly expensive projects (in the case where the relationship would not be linear). On the other hand, the probability of generating a ballot that is completely wrong (a ballot not including even a single ground-truth project) is zero.

Under \(\mathcal {M}_{ Ncost }\), maximising the likelihood would be similar to maximising the cost-approval Nash social welfare of a budget allocation. However, for this intuitive connection to hold, it should be the case that the normalisation factor \(Z_{\pi ^\star }^{ Ncost }\) is independent of \(\pi ^\star\). So let us look at it in more detail.

Lemma 6

For the noise model \(\mathcal {M}_{ Ncost }\) to be a well-defined probability distribution, it must be the case that:

$$\begin{aligned} Z_{\pi ^\star }^{ Ncost } = 2^{|\mathcal {P}| - 1} c(\pi ^\star ). \end{aligned}$$

Proof

Consider an instance \(I = \left\langle \mathcal {P}, c, b \right\rangle\), an approval ballot \(A \subseteq \mathcal {P}\), and a ground truth \(\pi ^\star \in \mathcal {A}(I)\). For \(\mathcal {M}_{ Ncost }\) to be a probability distribution, it should be the case that:

$$\begin{aligned} \sum _{A \subseteq \mathcal {P}} \mathbb {P}_{\mathcal {M}_{ Ncost }} (A \mid \pi ^\star , I) = 1 \quad \Longleftrightarrow \quad Z_{\pi ^\star }^{ Ncost } = \sum _{A \subseteq \mathcal {P}} c(|A \cap \pi ^\star |). \end{aligned}$$

Remember that there are \(2^{|\mathcal {P}|}\) subsets of projects and that any project \(p \in \mathcal {P}\) appears in exactly half of them. Each time a project \(p \in \pi ^\star\) appears in a subset \(A \subseteq \mathcal {P}\), its contribution to the value of \(Z_{\pi ^\star }^{ Ncost }\) is exactly c(p). We thus have:

$$\begin{aligned} Z_{\pi ^\star }^{ Ncost } = \sum _{A \subseteq \mathcal {P}} c(|A \cap \pi ^\star |) = \sum _{p \in \pi ^\star } 2^{|\mathcal {P}| - 1} c(p) = 2^{|\mathcal {P}| - 1} c(\pi ^\star ). \end{aligned}$$

\(\square\)

This result tells us that the normalisation factor of the noise model \(\mathcal {M}_{ Ncost }\) depends on the ground truth, meaning that the value of the likelihood is impacted by the ground truth one is considering when computing the MLE. In particular, we cannot conclude that the Nash cost-approval maximising rule is the MLE for this noise model since not all feasible budget allocations have the same cost.

Are there specific cases for which the normalisation factor is independent of the ground truth? Yes, namely for unit-cost instances, as then all exhaustive allocations have the same cost.

Proposition 7

Under the assumption that the ground truth is exhaustive, both \(F^N_{ app }\) and \(F^N_{ cost }\) are the MLE of the noise model \(\mathcal {M}_{ Ncost }\) for unit-cost instances.

Proof

Let I be a unit-cost instance I. Consider any two exhaustive budget allocations \(\pi , \pi ' \in \mathcal {A}_{ EX }(I)\). Since we have \(|\pi | = |\pi '| = c(\pi ) = c(\pi ')\), Lemma 6 entails that \(Z_{\pi }^{ Ncost } = Z_{\pi '}^{ Ncost }\). For any profile \(\varvec{A}\), we have then:

$$\begin{aligned} \mathop {\textrm{argmax}}\limits _{\pi \in \mathcal {A}_{ EX }(I)} L_{\mathcal {M}_{ Ncost }}(\varvec{A}, \pi , I)&= \mathop {\textrm{argmax}}\limits _{\pi \in \mathcal {A}_{ EX }(I)} \prod _{A \in \varvec{A}} \frac{c(A \cap \pi )}{Z_{\pi }^{ Ncost }} \\&= \mathop {\textrm{argmax}}\limits _{\pi \in \mathcal {A}_{ EX }(I)} \prod _{A \in \varvec{A}} c(A \cap \pi ) \\&= F^N_{ cost }(I, \varvec{A}). \end{aligned}$$

The last line follows form the fact that \(F^N_{ cost }\) is exhaustive.

Given that \(F^N_{ app }\) and \(F^N_{ cost }\) coincide on unit-cost instances, this also applies to \(F^N_{ app }\). \(\square\)

The fact that \(F^N_{ app }\) and \(F^N_{ cost }\) are MLEs for \(\mathcal {M}_{ Ncost }\) only under some restricted hypothesis is the first hint of a general impossibility result. Indeed, we can show that there are no noise models for which these rule are MLEs.

Theorem 8

There is no noise model \(\mathcal {M}\) such that either \(F^N_{ app }\) or \(F^N_{ cost }\) is the MLE of \(\mathcal {M}\), not even on unit-cost instances.

Proof

Consider an instance I with two projects \(p_1\) and \(p_2\) of cost 1, and a budget limit of \(b = 2\). Let \(\mathcal {M}\) be a generic noise model, and denote by \(\mathbb {P}_A^\pi\) the value of \(\mathbb {P}_\mathcal {M}(A \mid \pi , I)\) for any A and \(\pi\). To simplify notation, we omit braces around sets.

For the noise model \(\mathcal {M}\) to be a well-defined probability distribution, the following two equalities should be satisfied:

$$\begin{aligned} \sum _{A \subseteq \mathcal {P}} \mathbb {P}_{A}^{p_1} = 1 & \hspace{5.0pt}\Leftrightarrow \hspace{5.0pt} & \mathbb {P}_{\emptyset }^{~p_1} & + & \mathbb {P}_{p_1}^{~p_1} & + & \mathbb {P}_{p_2}^{~p_1} & + & \mathbb {P}_{p_1, p_2}^{~p_1} & = & 1, \end{aligned}$$
(1)
$$\begin{aligned} \sum _{A \subseteq \mathcal {P}} \mathbb {P}_{A}^{p_1, p_2} = 1 & \hspace{5.0pt}\Leftrightarrow \hspace{5.0pt} & \mathbb {P}_{\emptyset }^{~p_1, p_2} & + & \mathbb {P}_{p_1}^{~p_1, p_2} & + & \mathbb {P}_{p_2}^{~p_1, p_2} & + & \mathbb {P}_{p_1, p_2}^{~p_1, p_2} & = & 1. \end{aligned}$$
(2)

Now, on the single-agent profile \(\varvec{A}= (\emptyset )\), \(F^N_{ app }\) returns \(\mathcal {A}(I)\). So for \(F^N_{ app }\) to be the MLE of \(\mathcal {M}\), we must have \(\mathbb {P}_{\emptyset }^{~p_1} = \mathbb {P}_{\emptyset }^{~p_1, p_2}\). Moreover, on \(\varvec{A}= (\{p_1\})\), we have \(F^N_{ app }(I, \varvec{A}) = \{\{p_1\}, \{p_1, p_2\}\}\), so \(\mathbb {P}_{p_1}^{~p_1} = \mathbb {P}_{p_1}^{~p_1, p_2}\). Using these two equalities and by subtracting (2) from (1), we get:

$$\begin{aligned} (\mathbb {P}_{p_2}^{~p_1} - \mathbb {P}_{p_2}^{~p_1, p_2}) \quad + \quad (\mathbb {P}_{p_1, p_2}^{~p_1} - \mathbb {P}_{p_1, p_2}^{~p_1, p_2}) \quad = \quad 0. \end{aligned}$$
(3)

Now, since \(F^N_{ app }(I, (\{p_2\})) = \{\{p_2\}, \{p_1, p_2\}\}\), we must have \(\mathbb {P}_{p_2}^{~p_1, p_2} > \mathbb {P}_{p_2}^{~p_1}\). For \(\varvec{A}= (\{p_1, p_2\})\), we have \(F^N_{ app }(I, \varvec{A}) = \{\{p_1, p_2\}\}\). We can then derive \(\mathbb {P}_{p_1, p_2}^{~p_1, p_2} > \mathbb {P}_{p_1, p_2}^{~p_1}\). These two last inequalities contradict (3). It is then impossible for \(F^N_{ app }\) to be the MLE of \(\mathcal {M}\) on I. From the unit-cost assumption, it is clear that this also applies to \(F^N_{ cost }\). \(\square\)

This impossibility result concludes this part of our analysis. We will now consider “normalised” satisfaction functions.

4.1.2 Normalised satisfaction

In the hope of overcoming Theorem 8, we also consider normalised variants of the cardinality and cost satisfaction functions. In these variants, the satisfaction of an agent is expressed in terms of the proportion of the outcome that satisfies them. The rules induced by these normalised satisfaction functions are denoted by \(\tilde{F}^N_{ app }\) and \(\tilde{F}^N_{ cost }\), and they are the argmax rules defined in terms of the following functions:

$$\begin{aligned} \tilde{f}^N_{ app }(I, \varvec{A}, \pi )&= {\left\{ \begin{array}{ll} \sum _{A \in \varvec{A}} \log (|A \cap \pi |) - \log (|\pi |) & \textit{if for all } A' \in \varvec{A}, |A' \cap \pi | \ne 0, \\ 0 & \textit{otherwise.} \end{array}\right. } \\ \tilde{f}^N_{ cost }(I, \varvec{A}, \pi )&= {\left\{ \begin{array}{ll} \sum _{A \in \varvec{A}} \log (c(A \cap \pi )) - \log (c(\pi )) & \textit{if for all } A' \in \varvec{A}, c(A' \cap \pi ) \ne 0, \\ 0 & \textit{otherwise.} \end{array}\right. }. \end{aligned}$$

These rules are inspired by the concept of relative satisfaction introduced by [22]. However, while in the cited work the authors normalised the satisfaction with respect to the ballot, we define it here with respect to the budget allocation.

It is worth noting that the rules \(\tilde{F}^N_{ app }\) and \(\tilde{F}^N_{ cost }\) can lead to extreme behaviours. For example, consider an instance with budget limit b, that we assume to be even, and a set of projects \(\mathcal {P}= \{p^\star \} \cup \{p_1, \ldots , p_b\}\) of arbitrary cost lower than b. Consider the two-agent profile \(\varvec{A}\) such that:

$$\begin{aligned} A_1&= \{p^\star \} \cup \{p_1, p_3, \ldots , p_{b - 1}\}&A_2&= \{p^\star \} \cup \{p_2, p_4, \ldots , p_b\}. \end{aligned}$$

Then, according to both rules, selecting just \(p^\star\) is better than anything else. Even if this can seem extreme, these rules can still be justified when considering voters who would rather save public money than use it on projects they do not approve. This would correspond to associating a strong rejection, rather than indifference, with the action of not approving a project.

Note that this example also implies that the rules are not exhaustive, even on unit-cost instances.

Let us first investigate the rule \(\tilde{F}^N_{ cost }\). We will continue using the noise model \(\mathcal {M}_{ Ncost }\) introduced earlier.

Recall the expression we found for the normalisation factor \(Z_{\pi ^\star }^{ Ncost }\) in Lemma 6. Plugging it into the definition of \(\mathcal {M}_{ Ncost }\), we obtain the following expression for any instance I, approval ballot A, and ground truth \(\pi ^\star\):

$$\begin{aligned} \mathbb {P}_{\mathcal {M}_{ Ncost }} (A \mid \pi ^\star , I) = \frac{1}{2^{|\mathcal {P}| - 1}} \frac{c(A \cap \pi ^\star )}{c(\pi ^\star )}. \end{aligned}$$

From this, it should be clear that \(\tilde{F}^N_{ cost }\) is the MLE of \(\mathcal {M}_{ Ncost }\).

Theorem 9

The rule \(\tilde{F}^N_{ cost }\) is the MLE of the noise model \(\mathcal {M}_{ Ncost }\).

Proof

Let \(I = \left\langle \mathcal {P}, c, b \right\rangle\) be an instance. The likelihood of a profile \(\varvec{A}\) and a budget allocation \(\pi \in \mathcal {A}(I)\) under the noise model \(\mathcal {M}_{ Ncost }\) is:

$$\begin{aligned} L_{\mathcal {M}_{ Ncost }}(\varvec{A}, \pi , I) = \prod _{A \in \varvec{A}} \frac{1}{2^{|\mathcal {P}| - 1}} \frac{c(A \cap \pi )}{c(\pi )} = \left( \frac{1}{2^{|\mathcal {P}| - 1}}\right) ^{|\varvec{A}|}\prod _{A \in \varvec{A}} \frac{c(A \cap \pi )}{c(\pi )}. \end{aligned}$$

Since the first multiplicative factor in the above expression is constant over all budget allocations, we have:

$$\begin{aligned} \mathop {\textrm{argmax}}\limits _{\pi \in \mathcal {A}(I)} L_{\mathcal {M}_{ Ncost }}(\varvec{A}, \pi , I) = \mathop {\textrm{argmax}}\limits _{\pi \in \mathcal {A}(I)} \prod _{A \in \varvec{A}}\frac{c(A \cap \pi )}{c(\pi )} = \tilde{F}^N_{ cost }(I, \varvec{A}). \end{aligned}$$

Thus, maximising the likelihood under \(\mathcal {M}_{ Ncost }\) is the same as maximising the social welfare in the sense of \(\tilde{F}^N_{ cost }\). \(\square\)

We have finally been able to find a PB rule that can be interpreted as an MLE. In the following we will show a similar result for \(\tilde{F}^N_{ app }\). For this rule we introduce a new noise model denoted by \(\mathcal {M}_{ Napp }\). It is such that for any instance I, approval ballot A, and ground truth \(\pi ^\star\), we have:

$$\begin{aligned} \mathbb {P}_{\mathcal {M}_{ Napp }} (A \mid \pi ^\star , I) = \frac{1}{Z_{\pi ^\star }^{ Napp }} |A \cap \pi ^\star |, \end{aligned}$$

where \(Z_{\pi ^\star }^{ Napp }\) is a suitable normalisation factor.

Using a similar proof technique as above, we show that \(\tilde{F}^N_{ app }\) is the MLE of \(\mathcal {M}_{ Napp }\).

Theorem 10

The rule \(\tilde{F}^N_{ app }\) is the MLE of the noise model \(\mathcal {M}_{ Napp }\).

Proof

Let us first compute the exact value of the normalisation factor \(Z_{\pi ^\star }^{ Napp }\). For the noise model \(\mathcal {M}_{ Napp }\) to be a well-defined probability distribution, the following must hold:

$$\begin{aligned} \sum _{A \subseteq \mathcal {P}} \mathbb {P}_{\mathcal {M}_{ Napp }} (A \mid \pi ^\star , I) = 1 \hspace{5.0pt}\Leftrightarrow \hspace{5.0pt}Z_{\pi ^\star }^{ Napp } = \sum _{A \subseteq \mathcal {P}} |A \cap \pi ^\star | \hspace{5.0pt}\Leftrightarrow \hspace{5.0pt}Z_{\pi ^\star }^{ Napp } = 2^{|\mathcal {P}| - 1} |\pi ^\star |. \end{aligned}$$

Hence, given a profile \(\varvec{A}\), we have:

$$\begin{aligned} \mathop {\textrm{argmax}}\limits _{\pi \in \mathcal {A}(I)} L_{\mathcal {M}_{ Napp }}(\varvec{A}, \pi )&= \mathop {\textrm{argmax}}\limits _{\pi \in \mathcal {A}(I)} \prod _{A \in \varvec{A}} \frac{1}{2^{|\mathcal {P}| - 1}} \frac{|A \cap \pi |}{|\pi |} \\&= \mathop {\textrm{argmax}}\limits _{\pi \in \mathcal {A}(I)} \prod _{A \in \varvec{A}} \frac{|A \cap \pi |}{|\pi |} \\&= \tilde{F}^N_{ app }(I, \varvec{A}). \end{aligned}$$

This immediately implies that \(\tilde{F}^N_{ app }\) is the MLE of \(\mathcal {M}_{ Napp }\). \(\square\)

This concludes our study of the Nash social welfare. We now turn to the more standard measure of the social welfare, the utilitarian social welfare.

4.2 Utilitarian social welfare

Let us now turn to the analysis of monotonic argmax rules defined in terms of utilitarian social welfare. As we did before, we will consider different satisfaction functions.

4.2.1 Cardinality and cost satisfaction

Following [32], we define the approval maximising rule \(F_{ app }\) and the cost-approval maximising rule \(F_{ cost }\) as the argmax rules determined by \(f_{ app }\) and \(f_{ cost }\), respectively:

$$\begin{aligned}{f_{app}}(I,{\rm{ }}A,\pi ) &= \sum\limits_{A \in {\rm{ }}A} | A \cap \pi |, \\ {f_{\cos t}}(I,{\rm{ }}A,\pi ) &= \sum\limits_{A \in {\rm{ }}A} c (A \cap \pi ).\end{aligned}$$

It should be clear that these two rules are monotonic argmax rule (since the score they use is additive) and thus both satisfy weak reinforcement. Note also that these two rules are exhaustive.

Following an idea developed by [14] for scoring rules (in the standard voting framework), we introduce the noise model \(\mathcal {M}_{ app }\). It is defined such that for any instance \(I = \left\langle \mathcal {P}, c, b \right\rangle\), approval ballot \(A \subseteq \mathcal {P}\), and ground truth \(\pi ^\star \in \mathcal {A}(I)\):

$${P_{{M_{app}}}}(A\mid {\pi ^ \star },I) = {1 \over {Z_{{\pi ^ \star }}^{app}}}\prod\limits_{p \in {\cal P}} {{2^{{1_{p \in A \cap {\pi ^ \star }}}}}} = {1 \over {Z_{{\pi ^ \star }}^{app}}}{2^{|A \cap {\pi ^ \star }|}},$$

where \(Z_{\pi ^\star }^{ app }\) is a suitable normalisation factor.

\(\mathcal {M}_{ app }\) is a particularly simple manifestation of what we would expect to see in a noise model: any possible ballot might be generated in principle, but the probability of generating ballot A increases exponentially with the size of the intersection between A and the ground truth.

With this noise model, maximising the likelihood may appear to have the same effect as maximising the approval score of a budget allocation. It could then be the case that the approval maximising rule is the MLE of \(\mathcal {M}_{ app }\). However, for this to hold, one has to have a closer look at the normalisation factor.

Lemma 11

For the noise model \(\mathcal {M}_{ app }\) to be a well-defined probability distribution, it must be the case that:

$$\begin{aligned} Z_{\pi ^\star }^{ app } = 2^{|\mathcal {P}|} \left( \frac{3}{2}\right) ^{|\pi ^\star |}. \end{aligned}$$

Proof

Consider any instance \(I = \left\langle \mathcal {P}, c, b \right\rangle\). Let \(A \subseteq \mathcal {P}\) be an approval ballot and \(\pi ^\star \in \mathcal {A}(I)\) a ground truth. For \(\mathcal {M}_{ app }\) to be a probability distribution, it should be the case that:

$$\begin{aligned} \sum _{A \subseteq \mathcal {P}} \mathbb {P}_{\mathcal {M}_{ app }}(A \mid \pi ^\star , I) = 1 \qquad \Longleftrightarrow \qquad Z_{\pi ^\star }^{ app } = \sum _{A \subseteq \mathcal {P}} 2^{|A \cap \pi ^\star |}. \end{aligned}$$

Let’s do some combinatorics. For \(k \in \{0, \ldots , |\pi ^\star |\}\), how many subsets of \(\mathcal {P}\) will intersect with \(\pi ^\star\) on exactly k projects? A suitable subset will consist of k projects from \(\pi ^\star\) that make up the intersection and any number \(j \in \{0, \ldots , |\mathcal {P}| - |\pi ^\star |\}\) of projects from \(\mathcal {P}\setminus \pi ^\star\) that do not have any impact on the intersection. Each such subset of projects contributes \(2^k\) to the value of \(Z_{\pi ^\star }^{ app }\). We thus have:

$$\begin{aligned} Z_{\pi ^\star }^{ app }&= \sum _{k = 0}^{|\pi ^\star |} 2^k \sum _{j = 0}^{|\mathcal {P}| - |\pi ^\star |} \left( {\begin{array}{c}|\pi ^\star |\\ k\end{array}}\right) \left( {\begin{array}{c}|\mathcal {P}| - |\pi ^\star |\\ j\end{array}}\right) \\&= \sum _{k = 0}^{|\pi ^\star |} \left( {\begin{array}{c}|\pi ^\star |\\ k\end{array}}\right) 2^k \sum _{j = 0}^{|\mathcal {P}| - |\pi ^\star |} \left( {\begin{array}{c}|\mathcal {P}| - |\pi ^\star |\\ j\end{array}}\right) \\&= 2^{|\mathcal {P}| - |\pi ^\star |} \sum _{k = 0}^{|\pi ^\star |} \left( {\begin{array}{c}|\pi ^\star |\\ k\end{array}}\right) 2^k \\&= 2^{|\mathcal {P}|} \left( \frac{3}{2}\right) ^{|\pi ^\star |}, \end{aligned}$$

where the last two lines are derived from the binomial expansion. \(\square\)

The normalisation factor of \(\mathcal {M}_{ app }\) thus depends on the ground truth, since not all feasible budget allocations have the same cardinality. We thus cannot conclude that the approval maximising rule is the MLE of this noise model.

Interestingly, this is not the case on unit-cost instances when considering exhaustive budget allocations.

Proposition 12

Under the assumption that the ground truth is exhaustive, both \(F_{ app }\) and \(F_{ cost }\) are the MLE of the noise model \(\mathcal {M}_{ app }\) for unit-cost instances.

Proof

Let I be a unit-cost instance. For any two exhaustive budget allocations \(\pi\) and \(\pi ' \in \mathcal {A}_{ EX }(I)\), by virtue of Lemma 11, we have \(Z_{\pi }^{ app } = Z_{\pi '}^{ app }\).

So, for any profile \(\varvec{A}\), we have:

$$\begin{aligned} \mathop {\textrm{argmax}}\limits _{\pi \in \mathcal {A}_{ EX }(I)} L_{\mathcal {M}_{ app }}(\varvec{A}, \pi , I)&= \mathop {\textrm{argmax}}\limits _{\pi \in \mathcal {A}_{ EX }(I)} \prod _{A \in \varvec{A}} \frac{1}{Z_{\pi }^{ app }} 2^{|A \cap \pi |} \\&= \mathop {\textrm{argmax}}\limits _{\pi \in \mathcal {A}_{ EX }(I)} 2^{\sum _{A \in \varvec{A}} |A \cap \pi |} \\&= \mathop {\textrm{argmax}}\limits _{\pi \in \mathcal {A}_{ EX }(I)} \sum _{A \in \varvec{A}} |A \cap \pi | \\&= F_{ app }(I, \varvec{A}). \end{aligned}$$

The last line follows from the fact that \(F_{ app }\) is exhaustive.

\(F_{ app }\) thus coincides with the MLE on I for the noise model \(\mathcal {M}_{ app }\). Moreover, since \(F_{ app }\) and \(F_{ cost }\) coincide on unit-cost instances, the result also applies to \(F_{ cost }\). \(\square\)

But this result is only half satisfactory. Can we find an impossibility result similar to the one we had for \(F^N_{ app }\) and \(F^N_{ cost }\)? It is actually easy to see that the proof we gave for Theorem 8 also works for both \(F_{ app }\) and \(F_{ cost }\).

Theorem 13

There is no noise model \(\mathcal {M}\) such that either \(F_{ app }\) or \(F_{ cost }\) is the MLE of \(\mathcal {M}\), not even on unit-cost instances.

Proof

Consider the instance I used in the proof of Theorem 8. We claim that for all profiles that are relevant for the proof, \(F_{ app }\) and \(F^N_{ app }\) coincide. We list them below.

$$\begin{aligned} F_{ app }(I, (\{p_1\}))= & \{\{p_1\}, \{p_1, p_2\}\} = F^N_{ app }(I, (\{p_1\})).\\ F_{ app }(I, (\{p_2\}))= & \{\{p_2\}, \{p_1, p_2\}\} = F^N_{ app }(I, (\{p_2\})).\\ F_{ app }(I, (\{p_1, p_2\}))= & \{\{p_1, p_2\}\} = F^N_{ app }(I, (\{p_1, p_2\})).\\ F_{ app }(I, (\emptyset ))= & \mathcal {A}(I) = F^N_{ app }(I, (\emptyset )). \end{aligned}$$

Given that on unit-cost instances \(F_{ app }\) and \(F_{ cost }\) coincide, this also applies to \(F_{ cost }\). \(\square\)

It is interesting to note that the same result applies if one maximises the total relative satisfaction as introduced by [22], i.e., the sum over all agents of the number (or cost) of approved and selected projects divided by the number (or cost) of approved projects. Indeed, one can easily check that for all relevant profiles this maximisation rule coincides with \(F^N_{ app }\). This trivially also applies when using the Nash social welfare with relative satisfaction.

As an aside, observe that the greedy cost approval rule \(F_{ greed }\) discussed in Sect. 2.2 coincides with \(F_{ app }\) on unit-cost instances. Thus, both Proposition 12 and Theorem 13 apply to \(F_{ greed }\) as well, and we can refine Proposition 2 as follows.

Corollary 14

Under the assumption that the ground truth is exhaustive, the greedy cost approval rule \(F_{ greed }\) is the MLE for \(\mathcal {M}_{ app }\) for unit-cost instances.

Moreover, for unconstrained ground truths, there is no noise model \(\mathcal {M}\) such that \(F_{ greed }\) is the MLE for \(\mathcal {M}\), not even on unit-cost instances.

These insights apply also for any refinements of these rules, such as the leximax rule introduced by [28].

4.2.2 Normalised satisfaction

Let us conclude our formal analysis by briefly mentioning the utilitarian social welfare with the normalised satisfaction functions. For the same reasons as for \(\tilde{F}^N_{ app }\) and \(\tilde{F}^N_{ cost }\), the corresponding utilitarian rules are not exhaustive. Analysing the epistemic status of these rules however turns out to be rather intricate, even on unit-cost instances. Indeed, it is less clear what a suitable noise model might look like, especially due to the complications related to the potential normalisation factor. Exploring these rules remains an interesting open problem.

Table 1 Summary of results: The sum \(\Sigma\) and product \(\Pi\) symbols represent the utilitarian and the Nash variant of a welfare-based rule

5 Conclusion

We have initiated the study of PB through the truth-tracking lens. For a total of eleven rules, we investigated whether they can be interpreted as MLEs. For those that cannot, we tried to identify specific conditions under which they still can serve as MLEs. All our results are summarised in Table 1.

There is still some work to be done regarding the study of MLEs in the context of PB. The most obvious task would be to answer the open questions corresponding to the two missing cells in Table 1. In the case of our positive results, one could try to come up with more natural noise models for which the rules are MLEs, e.g., noise models for which not all the supersets of the ground truth have the same probability. Our work also shows a certain tension between efficiency requirements (exhaustiveness) and truth-tracking ability: the two rules that we proved to be MLEs (for the general case) both fail exhaustiveness. This interaction deserves further study. Similarly, we found that the most studied rules that enforce proportional representation fail to satisfy weak reinforcement. It would be interesting to investigate whether or not this constitutes a formal incompatibility.

Then, on top of the MLE concept, the epistemic approach also offers several other ways of studying voting rules. As we have seen, finding rules that are MLEs is not an easy task. It might be the case that this is simply too demanding a requirement. Instead, other criteria that have been studied in the literature on epistemic social choice could be applied to the PB setting as well. For instance, it could be interesting to study PB rules with respect to their sample complexity [13] or their robustness against noise [10]—a criterion that is somewhat similar to the MLE requirement but easier to satisfy. All of these constitute interesting directions for future work on a topic that is still very much under-studied and deserving of further attention.