Let agent i be the learning agent, with reservation prices
and
. Let
and
be agent i's projected buying
and selling prices of other agents.
Suppose agent i can be matched as a buyer according to its
reservation price, and is matched to seller j. By the matching
criterion, we must have
. From the
discussion before, we know that agent i wants to bid a buy price lower
than its own reservation buy price, i.e.
, and
indeed should bid as low as possible in order to maximize its payoff.
However, if it bids lower than
, it will lose the
chance
to trade with agent j.
Another way agent i might lose the opportunity to trade is if its
buy price is lower than the other potential buyers. Suppose that agent k
has the next highest bid other than agent i. If
, agent k
will get the match with agent j.
Therefore, agent i maximizes its reward by choosing
| (6) |
By similar analysis, suppose agent i can be matched as a seller to buyer j, and the next lowest seller is agent k, agent i should choose its sell-price such that
![]()
If agent i predicts that it cannot be matched either as a buyer or a seller according to its reservation prices, it simply bids its reservation prices.