Okay, so some things that come off immediately. We can reject with 99% confidence that the true mean of the population is .43 at basically every level of n except a couple. That's an odd finding, since we know that the true mean is .43. This isn't an indication of a broken shuffler with fat tails, this is an indication that the shuffler consistently prefers to put land toward the top of the deck - in other words a 26 land deck is closer in terms of its performance to a 30 land deck.
That's not just a problematic shuffler, that's a complete systematic failure.
I'm not trying to start trouble here, but are you sure the deck had 26 lands in it? And no ways to consistently draw additional lands. Because if not, the duels shuffler doesn't work at all. And I don't mean slightly broken, I mean completely and obviously broken.
If you hadn't told us that the deck had 26 lands in it I would reject that possibility with near certainty. The minimum number of lands I'd even believe (had you not explicitly told me otherwise) is 28, but the data indicates 30.
Mean:.4956765
SError:.0124345
99% CI: (.4619118 .5294411)
FYI, I clustered the data by cards drawn, as it was supposed to be, in case you want to find how I got the exact numbers - doing so increases the Confidence Interval, so if we don't think it's necessary, then the rejection is even stronger.
For the record, these are shocking results.
Let's be a little more clear here: this absolutely suggests that you would
always expect to draw too much land. It's not suggesting that low lands lead to few land draws, or high lands lead to high land draws - it's suggesting that no matter what you'll draw too many lands. It damn near suggests that mana screw (not flood) is extremely unlikely - should basically never happen.
Can you generate a few plots for the most common n's to visualize the duels behavior a bit better (i'm assuming in the raw data he sent you, he noted the amount of turns played per sample) ?