Subscription

Explore Products

Best Sellers

New Releases

Books

Videos

Audiobooks

Learning Hub

Free Learning

You're reading from PostgreSQL 11 Administration Cookbook Over 175 recipes for database administrators to manage enterprise databases

Product type Paperback

Published in May 2019

Publisher Packt

ISBN-13 9781789537581

Length 600 pages

Edition 1st Edition

Languages

SQL

Tools

PostgreSQL

Concepts

Databases

Authors (3):

Gianni Ciolli

Sudheer Kumar Meesala

Simon Riggs

View More author details

Table of Contents (14) Chapters

Preface

1. First Steps FREE CHAPTER

2. Exploring the Database

3. Configuration

4. Server Control

5. Tables and Data

6. Security

7. Database Administration

8. Monitoring and Diagnosis

9. Regular Maintenance

10. Performance and Concurrency

11. Backup and Recovery

12. Replication and Upgrades

13. Other Books You May Enjoy

Leave a review - let other readers know what you think

Identifying and removing duplicates

Relational databases work on the idea that items of data can be uniquely identified. However hard we try, there will always be bad data arriving from somewhere. This recipe shows you how to diagnose that and clean up the mess.

Getting ready

Let's start by looking at our example table, cust. It has a duplicate value in customerid:

postgres=# SELECT * FROM cust;
 customerid | firstname | lastname | age
------------+-----------+----------+-----
          1 | Philip    | Marlowe  |  38
          2 | Richard   | Hannay   |  42
          3 | Holly     | Martins  |  25
          4 | Harry     | Palmer   |  36
          4 | Mark      | Hall     |  47
(5 rows)

Before you delete duplicate data, remember...

The rest of the chapter is locked

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at $19.99/month. Cancel anytime

Authors (3)

Simon Riggs

Simon Riggs is the CTO of 2ndQuadrant, having contributed to PostgreSQL as a major developer and committer for 14 years. He has written and designed features for replication, performance, BI, management, and security. Under his guidance, 2ndQuadrant is now a leading developer of open source PostgreSQL, serving hundreds of clients in USA, Europe, and worldwide. Simon is a frequent speaker at many conferences on PostgreSQL Futures. He has worked as a database architect for 30 years.

See other products by Simon Riggs

GIANNI CIOLLI

Gianni Ciolli is the head of professional services at 2ndQuadrant and has been a PostgreSQL consultant, trainer, and speaker at many PostgreSQL conferences in Europe and abroad over the last 10 years. He has a PhD in Mathematics from the University of Florence. He has worked with free and open source software since the 1990s and is active in the community (the Prato Linux User Group and the Italian PostgreSQL Users Group). He lives in London with his son. His other interests include music, drama, poetry, and athletics.

See other products by GIANNI CIOLLI

Meesala

Sudheer Kumar Meesala is a lead architect at Endurance International Group and has spent the last few years designing and building scalable and secure web applications within finance and internet industries. A large part of his job has included decomposing monolithic legacy applications into microservices. This has required a deep understanding of PostgreSQL, Cassandra, and other NoSQL databases. Other key areas of interest are container orchestration, DevOps, and more. He is also an accomplished speaker and trainer. He lives in Bangalore, India, and spends far too much time in traffic jams.

See other products by Meesala