There are many ways to get bad results from pgbench, ones that don't mean anything valid. And there are even more ways to get results that vary so much from one pgbench run to another that they don't mean what you would imagine them to. Normally you should run any pgbench test at least three times and observe the average and variation before presuming that test is valid. Taking the middle of three results is a common median technique for filtering those results into a single one for comparison.
You must let the test run for a while to get useful results. As a general rule of thumb, if you haven't executed a minute's worth of runtime, you're unlikely to have gotten useful results at all. The main exception is the SELECT only test, which can give useful results in a small number of seconds. Generally, if you've seen...