Question: Problem 1 (Strict Complementarity in Linear Programming) (100 pts): In the lecture, we mentioned that there is strict complementarity property that holds for LP, without

lecture, we mentioned that there is strict complementarity property that holds for

LP, without giving a proof. We will prove this property in this

problem. Let us formally state this property first. Consider the following primal/dual

L.P problem: (P)minxRcxs.t.Ax=b(D)maxyRbyx0s.t.Ayc In the discussion that follows, we will assume both

Problem 1 (Strict Complementarity in Linear Programming) (100 pts): In the lecture, we mentioned that there is strict complementarity property that holds for LP, without giving a proof. We will prove this property in this problem. Let us formally state this property first. Consider the following primal/dual L.P problem: (P)minxRcxs.t.Ax=b(D)maxyRbyx0s.t.Ayc In the discussion that follows, we will assume both problems (P) and (D) are feasible. By weak duality. we know that both primal and dual optimal solutions exist. And by strong duality, for any pair of primal and dual optimal solutions (x,y), we have: cx=by. As an immediate consequence, complementarity slackness holds and we have: s=cAy0,(s)x=0 The above complementarity slackness can be equivalently written as the following: si>0xi=0,xi>0si=0,i=1,2,,n. In the complementarity slackness, it is possible that xi=si=0 for some 1in. However, for linear programming, in fact there exists a primal optimal solution x and a dual optimal solution s (and y ) such that the strict complementarity holds. That is, exactly one of xi or si equals 0 but not both, for all i=1,,n, or equivalently the following holds: si>0xi=0,xi>0si=0,i=1,2,,n. We formally state it as the following theorem: Theorem 1. Suppose both (P) and (D) are feasible. Then there exists a primal optimal solution x and a dual optimal solution (y,s) with s=cAy, such that the strict complementarify (1) holds. To prove Theorem 1, we will need to use the next Lemma, whose proof will be constructed in this problem step by step. Lemma 1. Suppose both (P) and (D) are feasible. Then for any index 1in, either there exists a primal optimal solution x such that xi>0, or there exists a dual optimal solution s such that si>0. Note that we can not have both xi>0 and si>0 for some i, otherwise the (weak) complementa be violated, which is supposed to hold for any optimal solution x and s. Answer the following questions in order to prove Lemma 1. (d) Following part (c) for the case when the optimal value of (P) is 0 , but consider now the case when the (an) optimal solution (y,y0) to (D) is such that y0=0. Let y be any optimal solution to (D) and define y=y+y. Answer the next three questions. - Show that y is feasible to (D). - Show that y is an optimal solution to (D). - In this case, we have s:=cAy is such that si>0. Explain why. (e) Finally, we are left with the case when the optimal value of (P) is less than 0 . This means that there is a feasible solution to (P) with negative objective value. Give an explanation of why this fact indicates that there exists an optimal solution x to the primal problem (P) such that xj>0. Note the following relations: 1n=1n.1TE=1.1A=(1)1n. maxcx=bminbys.t.Ax=0mins.by=Ayc0x0. xx1nnx1 ise x0, The interpretation of exch quantity of the primal problem is as following - Each peimal wariable x1 is the action frequency of action jA, or the expected present value of number of times in which an individual is in state 1 and takes action i. - b=1m means there is one individual in each state. Specifically P is a columa wochustic matrix, meaning each column taking action jt at state i). - Ax=1m is the contervatiog law, which ensures that for each itate 1 , the expected present value of mumber of individuals entering state i equals the expected present value of number of individuats leaving ig (EP)=b - The primal problem is to find action frequencies that maximize the expected present reward cx. Interpretation of the LP Formulation (Dual) Policy and Basic Feasible Solution A polic x= in the original MDP, containing exactly one action in di for each state, t. corresponds to m basic variable indices of a. base feasible colution of the primal problem in the L.P formulation. Let An,Pm,EnRm be the columng of A,P,E, with indices in rolution for policy $. Denote y= (o) y= be the remaining indices that are not in . The interpretation of each quantity of the dual problem is as following: - Each dual variable y assigns a value to each state i. is then x rend intoriale - The optimal dual variable y, satisfying A7yc, is the optimal walue function satisfying the Bellman ootimality equation. - Note that there exists a unique optimal value function v+=y4 such that y is the maximum-expected retum at state f. - The minimum value of the duat problem br y, is in fact the largest sum of the total state values for all possible pollicies . xyx=0,A2x0=1,x1 The corresponding dual basic solution y2 is the unique solution to Axy>C A17y=c4 and is 5 Teasible solution if. A17yc4) AYA=G X The primal dual basic solution pair (x,y) is optimal if and only if both are fegsible, In such case, = is an optimal policy and yn=v is the optimat value function. The duat constraints exactly describe the (matrox form) Bellman optimality equations. Policy and Basic Feasible Solution Lemma There is a one-to-one correspondence between a policy of the original MDP and a basic feasible solution of the primal problem (P). Primal and Dual Basic Solutions Given any policy , which is also a basis of (P), we can compute An=1{Pn and the primal/dual basic solution pair x and (y,s) as: y2=(A2)1c As we have seen in the dual simplex method, the primal/dual basic solution pair is always complementary. Here, the primal basic solution xn is always feasible (xn0), indicating dual optimality. On the other hand, the primal optimality is only achieved at dual feasibility, In particular, when sy0. Policy and Basic Feasible Solution (Proof) Proof The logic is to show that every basic feasible solution of (P) must contain one index in each of state set Ai,i=1,,m, which corresponds to one action in each state, thus a policy. Let be the basis set of any basic feasible solution for (P). Consider the coefficients of the i-th row of A. We can see that ay0 for all j/Ai. If we assume for some i, no basic variable is chosen from Ai, of simply Ai=, then we have: which is a contradiction. That is, there must be at least one basic variable in each A,i=1,,m. However, since is a basis, =m. Then must contain exactly one (action) index in Ai for each state i=1,,m. In other words, is a policy. Strict Complementarity Lemma If both LP(P) and (D) are feasible, then there is a unique partition B{1,2,,n} and N{1,2,,n}, such that for each optimal solution pair (x,s). x^j=0,vjN1andsj=0,vjB, and there is at lesst one optimal solution pair (x,s) that is strictly complementary: xj>0,VjB1,andsj>0,VjN. In particular, every optimal policy B so that Bm and Nnm. The existence of strictly complementary optimal solution follow from the strict complementarity result of general L.P. We may call B the optimal (super) basic variable set and N the optimal non-basic variable set. The cardinality result follows from the fact that there must be an optimal basic feasible solution (or optimal policy) where the basic variables (optimal action frequencies) are all strictly positive, so their indices must belong to B. There may exist multiple optimal policies for an MDP, B contains actions, each of which appears in at least one optimal policy, and N contains the remaining actions, none of which appears in any optimal policy. The total number of non-optimal actions in N should not exceed nm for any MDP. The optimal dual basic feasible solution pair (y,5) is unique and invariant among the multiple optimal policies. Therefore, if j is a non-optimal action, then its optimal dual slack value s j must be positive. The LP Formulation (Basic/Non-Basic) maxs.t.cxAx=bx0,minbys.t.s=Ayc0 Given a policy and let denote the remaining indices not in . the primal problem can be expressed in the equivalent form: where c is the reduced cost and cn=0,cr=cyAyr and yn=(An+)1cn The LP Formulation (Basic/Non-Basic) Given a policy and let y denote the remaining indices not in , the primal problem can be expressed in the equivalent form: maxixi+cx0maxci(Am)1m+CyxnArx2+Axxn=1s.t.Axxr+Axr=1mx=(xr;x)0 The dual problem can be written as min15ys.A4y+s4=c4A1y+s2=c2s=(5,;,sn)0.

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related General Management Questions!

s sf Define the terms opaque type and concrete type. [5 marks] The following is a shortened version of one of the definition modules described in the Modula-2 user manual: Provide a suitable...

write report MAnAgEMENT ToOLS \& PRACTICE-ReadingS, PART 1 Sources: Robert C. Appleby (1991): Modern Business Administration. Pitman \& Cole, G. A. (1993): Management Theory and Practice. DP...

Portray in words what transforms you would have to make to your execution to some degree (a) to accomplish this and remark on the benefits and detriments of this thought.You are approached to compose...

Suppose that R(A, B, C) is a relational schema with functional dependencies F = {A, B C, C B}. (i) Is this schema in 3NF? Explain. [2 marks] (ii) Is this schema in BCNF? Explain. [2 marks] (b)...

(a) In SystemVerilog, what is the difference between: (i) The ternary operator ? and if...then...else statements? [2 marks] (ii) always_ff and always_comb? [2 marks] (iii) Blocking, non-blocking and...

Prolog You are approached to compose a Prolog program to work with twofold trees. Your code shouldn't depend on any library predicates and you ought to expect that the mediator is running without...

Sanford Law Firm, PLLC, Little Rock, by: Vanessa Kinney, for appellant. Kutak Rock LLP, Little Rock, by: David L. Williams and Katherine M. Hingtgen, for appellee. The Arkansas Court of Appeals...

Problem 1 . Determine the returns to scale for the following production functions. Give a mathematical proof of your answer. ( A ) F ( A , K , L ) = A K 1 6 L 1 6 ( C ) F ( A , K , L ) = K 2 + A L 2...

Please summarize this article, the length of summary should not be more than 2 pages with 1.5 spacing , size 12 Times New Rome Journal of International Money and Finance 17 1998. 535545 The use of...

Employees who have managers with Theory Y attitudes are just as unhappy with their jobs as employees who have managers with Theory X attitudes. T/F

During the current year, Horace's personal residence is damaged by a tornado. The residence had an adjusted basis of $80,000 before the tornado. The cost of repairing the damage is $30,000. Horace's...

Which of the following statements is true regarding impairment lonses on intageble assets in use? After the impairment loss is recorded, the anset will be reported at net realtrable value. The asset...

Premise: Using a dictionary and list, create a set of key / value pairs that consists of a musical database. The key / value pairs should consist of Artist Name: Song Names pattern. For example, EACH...