We study sequential shortest path interdiction, where in each period an interdictor with incomplete knowledge of the arc costs blocks at most [Formula: see text] arcs, and an evader with complete knowledge about the costs traverses a shortest path between two fixed nodes in the interdicted network. In each period, the interdictor, who aims at maximizing the evader’s cumulative cost over a finite time horizon, and whose initial knowledge is limited to valid lower and upper bounds on the costs, observes only the total cost of the path traversed by the evader, but not the path itself. This limited information feedback is then used by the interdictor to refine knowledge of the network’s costs, which should lead to better decisions. Different interdiction decisions lead to different responses by the evader and thus to different feedback. Focusing on minimizing the number of periods it takes a policy to recover a full information interdiction decision (that taken by an interdictor with complete knowledge about costs), we show that a class of greedy interdiction policies requires, in the worst case, an exponential number of periods to converge. Nonetheless, we show that under less stringent modes of feedback, convergence in polynomial time is possible. In particular, we consider different versions of imperfect randomized feedback that allow establishing polynomial expected convergence bounds. Finally, we also discuss a generalization of our approach for the case of a strategic evader, who does not necessarily follow a shortest path in each period.