Tech News

article-image-startup-focus-sea-machines-winning-contracts-for-autonomous-marine-systems-from-ai-trends

15 Oct 2020

8 min read

Startup Focus: Sea Machines Winning Contracts for Autonomous Marine Systems from AI Trends

15 Oct 2020

By AI Trends Staff The ability to add automation to an existing marine vessel to make it autonomous is here today and is being proven by a Boston company. Sea Machines builds autonomous vessel software and systems for the marine industry. Founded in 2015, the company recently raised $15 million in a Series B round, making it total raised $27.5 million since 2017. Founder and CEO Michael G. Johnson, a licensed marine engineer, recently took the time to answer via email some questions AI Trends poses to selected startups. Describe your team, the key people Sea Machines is led by a team of mariners, engineers, coders and autonomy scientists. The company today has a crew of 30 people based in Boston; Hamburg, Germany; and Esbjerg, Denmark. Sea Machines is also hiring for a variety of positions, which can be viewed at sea-machines.com/careers. Michael Johnson, Founder and CEO, Sea Machines What business problem are you trying to solve? The global maritime industry is responsible for billions in economic output and is a major driver of jobs and commerce. Despite the sector’s success and endurance, it faces significant challenges that can negatively impact operator safety, performance and profitability. Sea Machines is solving many of these challenges by developing technologies that are helping the marine industry transition into a new era of task-driven, computer-guided vessel operations. How does your solution address the problem? Autonomous systems solve for these challenges in several ways: Autonomous grid and waypoint following capabilities relieve mariners from manually executing planned routes. Today’s autonomous systems uniquely execute with human-like behavior, intelligently factoring in environmental and sea conditions (including wave height, pitch, heave and roll); change speeds between waypoints; and actively detect obstacles for collision avoidance purposes. Autonomous marine systems also enable optionally manned or autonomous-assist (reduced crew) modes that can reduce mission delays and maximize effort. This is an important feature for anyone performing time-sensitive operations, such as on-water search-and-rescues or other urgent missions. Autonomous marine systems offer obstacle detection and collision avoidance capabilities that keep people and assets safe and out of harm’s way. These advanced technologies are much more reliable and accurate than the human eye, especially in times of low light or in poor sea conditions. Because today’s systems enable remote-helm control and remote payload management, there is a reduced need for mariners (such as marine fire or spill response crews) to physically man a vessel in a dangerous environment. A remote-helm control beltpack also improves visibility by enabling mariners to step outside of the wheelhouse to whatever location provides the best vantage point when performing tight maneuvers, dockings and other precision operations. Autonomous marine systems enable situational awareness with multiple cameras and sensors streaming live over a 4G connection. This real-time data allows shoreside or at-sea operators a full view of an autonomous vessel’s environment, threats and opportunities. Minimally manned vessels can autonomously collaborate to cover more ground with less resources required, creating a force-multiplier effect. A single shoreside operator can command multiple autonomous boats with full situational awareness. These areas of value overlap for all sectors but for the government and military sector, new on-water capabilities and unmanned vessels are a leading driver. By contrast, the commercial sector is looking for increased productivity, efficiency, and predictable operations. Our systems meet all of these needs. Our technology is designed to be installed on new vessels as well as existing vessels. Sea Machines’ ability to upgrade existing fleets greatly reduces the time and cost to leverage the value of our autonomous systems. How are you getting to the market? Is there competition? Sea Machines has an established dealer program to support the company’s global sales across key commercial marine markets. The program includes many strategic partners who are enabled to sell, install and service the company’s line of intelligent command and control systems for workboats. To date, Sea Machines dealers are located across the US and Canada, in Europe, in Singapore and UAE. We have competition for autonomous marine systems, but our products are the only ones that are retrofit ready, not requiring new vessels to be built. Do you have any users or customers? Yes we have achieved significant sales traction since launching our SM series of products in 2018. Just since the summer, Sea Machines has been awarded several significant contracts and partnerships: The first allowed us to begin serving the survey vessel market with the first announced collaboration with DEEP BV in the Netherlands. DEEP’s vessel outfitted with the SM300 entered survey service very recently. Next, we partnered with Castine-based Maine Maritime Academy (MMA) and representatives of the U.S. Maritime Administration (MARAD)’s Maritime Environmental and Technical Assistance (META) Program to bring valuable, hands-on education about autonomous marine systems into the MMA curriculum. Then we recently announced a partnership with shipbuilder Metal Shark Boats, of Jeanerette, Louisiana, to supply the U.S. Coast Guard (USCG)’s Research and Development Center (RDC) with a new Sharktech 29 Defiant vessel for the purposes of testing and evaluating the capabilities of available autonomous vessel technology. USCG demonstrations are happening now (through November 5) off the coast of Hawaii. Finally, just this month, we announced that the U.S. Department of Defense (DOD)’s Defense Innovation Unit (DIU) awarded us with a multi-year Other Transaction (OT) agreement. The primary purpose of the agreement is to initiate a prototype that will enable commercial ocean-service barges as autonomous Forward Arming and Refueling Point (FARP) units for an Amphibious Maritime Projection Platform (AMPP). Specifically, Sea Machines will engineer, build and demonstrate ready-to-deploy system kits that enable autonomous, self-propelled operation of opportunistically available barges to land and replenish military aircraft. In the second half of 2020 we are also commencing onboard collaborations with some crew-transfer vessel (CTV) operators serving the wind farm industry. How is the company funded? The company recently completed a successful Series B round, which provided $15M in funds, with a total amount raised of $27.5M since 2017. The most recent funds we were able to raise are going to significantly impact Sea Machines, and therefore the maritime and marine industries as a whole. The funds will be put to use to further strengthen our technical development team as well as build out our next level of systems manufacturing and scale our operations group to support customer deployments. We will also be investing in some supporting technologies to speed our course to full dock-to-dock, over-the-horizon autonomy. The purpose of our technology is to optimize vessel operations with increased performance, productivity, predictability and ultimately safety. In closing, we’d like to add that the marine industries are a critically significant component of the global economy and it’s up to us to keep it strong and relevant. Along with people, processes and capital, pressing the bounds of technology is a key driver. The world is being revolutionized by intelligent and autonomous self-piloting technology and today we find ourselves just beyond the starting line of a busy road to broad adoption through all marine sectors. If Sea Machines continues to chart the course with forward-looking pertinence, then you will see us rise up to become one of the most significant companies and brands serving the industry in the 21st century. Any anecdotes/stories? This month we released software version 1.7 on our SM300. That’s seven significant updates in just over 18 months, each one providing increased technical hardening and new features for specific workboat sectors. Another interesting story is about our Series B funding, which, due to the pandemic, we raised virtually. Because of where we are as a company, we have been proving our ability to retool the marine industry with our technology, and therefore we are delivering confidence to investors. We were forced to conduct the entire process by video conference, which may have increased overall efficiency of the raise as these rounds traditionally require thousands if not tens of thousands of miles of travel for face-to-face meetings, diligence, and handshakes. Remote pitches also proved to be an advantage because it allowed us to showcase our technology in a more direct way. We did online demos where we had our team remotely connected to our vessels off Boston Harbor. We were able to get the investors into the captain’s chair, as if they were remotely commanding a vessel in real-world operations. Finally, in January, we announced the receipt of ABS and USCG approval for our SM200 wireless helm and control systems on a major class of U.S.-flag articulated tug-barges (ATBs), the first unit has been installed and is in operation, and we look forward to announcing details around it. We will be taking the SM200 forward into the type-approval process. Learn more at Sea Machines.

0
0
3428

article-image-web-applications-are-focus-of-cybercrime-gangs-in-data-breaches-report-finds-from-ai-trends

Matthew Emerick

15 Oct 2020

7 min read

Web Applications are Focus of Cybercrime Gangs in Data Breaches, Report Finds from AI Trends

Matthew Emerick

15 Oct 2020

7 min read

By John P. Desmond, AI Trends Editor Web applications are the primary focus of many cybercrime gangs engaged in data breaches, a primary security concern to retailers, according to the 2020 Data Breach Investigations Report (DBIR) recently released by Verizon, in its 13th edition of the report. Verizon analyzed a total of 157,525 incidents; 3,950 were confirmed data breaches. “These data breaches are the most serious type of incident retailers face. Such breaches generally result in the loss of customer data, including, in the worst cases, payment data and log-in and password combinations,” stated Ido Safruti, co-founder and chief technology officer, PerimeterX, a provider of security services for websites, in an account in Digital Commerce 360. Among the reports highlights: Misconfiguration errors, resulting from failure to implement all security controls, top the list of the fastest-growing risk to web applications. Across all industries, misconfiguration errors increased from below 20 percent in the 2017 survey to over 40 percent in the 2020 survey. “The reason for this is simple,” Safruti stated. “Web applications are growing more and more complex. What were formerly websites are now full-blown applications made up of dozens of components and leveraging multiple external services.” Ido Safruti, co-founder and chief technology officer, PerimeterX External code can typically comprise 70 percent or more of web applications, many of them JavaScript calls to external libraries and services. “A misconfigured service or setting for any piece of a web application offers a path to compromise the application and skim sensitive customer data,” Safruti stated. Cybercriminal gangs work to exploit rapid changes on web applications, as development teams build and ship new code faster and faster, often tapping third-party libraries and services. Weaknesses in version control and monitoring of changes to web applications for unauthorized introductions of code, are vulnerabilities. Magecart attacks, from a consortium of malicious hacker groups who target online shopping cart systems especially on large ecommerce sites, insert rogue elements as components of Web applications with the goal of stealing credit card data of shoppers. “Retailers should consider advanced technology using automated and audited processes to manage configuration changes,” Safruti advises. Vulnerabilities are not patched quickly enough, leaving holes for attacks to exploit. Only half of vulnerabilities are patched within three months of discovery, the 2020 DBIR report found. These attacks offer hackers the potential of large amounts of valuable customer information with the least amount of effort. Attacks against web application servers made up nearly 75% of breached assets in 2019, up from roughly 50% in 2017, the DBIR report found. Organized crime groups undertook roughly two-thirds of breaches and 86% of breaches were financially motivated. The global average cost of a data breach is $3.92 million, with an average of over $8 million in the United States, according to a 2019 study from the Ponemon Institute, a research center focused on privacy, data protection and information security. Another analysis of the 2020 DBIT report found that hacking and social attacks have leapfrogged malware as the top attack tactic. “Sophisticated malware is no longer necessary to perform an attack,” stated the report in SecurityBoulevard. Developers and QA engineers who develop and test web applications would benefit from the use of automated security testing tools and security processes that integrate with their workflow. “We believe developers and DevOps personnel are one of the weakest links in the chain and would benefit the most from remediation techniques,” the authors stated. Credential Stuffing Attack Exploit Users with Same Password Across Sites Credential stuffing is a cyberattack where lists of stolen usernames and/or email addresses are used to gain unauthorized access to user accounts through large-scale automated login requests directed against a web application. “Threat actors are always conducting credential stuffing attacks,” found a “deep dive” analysis of the 2020 DBIR report from SpyCloud, a security firm focused on preventing online fraud. The SpyCloud researchers advise users never to reuse passwords across online accounts. “Password reuse is a major factor in credential stuffing attacks,” the authors state. They advise using a password manager and storing a unique complex password for each account. The 2020 DBIR report found this year’s top malware variant to be password dumpers, malware that extracts passwords from infected systems. This malware is aimed at acquiring credentials stored on target computers, or involve keyloggers that acquire credentials as users enter them. Some 22 percent of breaches found were the result of social attacks, which are cyber attacks that involve social engineering and phishing. Phishing – making fake websites, emails, text messages, and social media messages to impersonate trusted entities – is still a major way that sensitive authentication credentials are acquired illicitly, SpyCloud researchers found. Average consumers are each paying more than $290 in out-of-pocket costs and spending 16 hours to resolve the effects of this data loss and the resultant account takeover, SpyCloud found. Business Increasing Investment in AI for Cybersecurity, Capgemini Finds To defend against the new generation of cyberattacks, businesses are increasing their investment in AI systems to help. Two-thirds of organizations surveyed by Capgemini Research last year said they will not be able to respond to critical threats without AI. Capgemini surveyed 850 senior IT executives from IT information security, cybersecurity and IT operations across 10 countries and seven business sectors. Among the highlights was that AI-enabled cybersecurity is now an imperative: Over half (56%) of executives say their cybersecurity analysts are overwhelmed by the vast array of data points they need to monitor to detect and prevent intrusion. In addition, the type of cyberattacks that require immediate intervention, or that cannot be remediated quickly enough by cyber analysts, have notably increased, including: cyberattacks affecting time-sensitive applications (42% saying they had gone up, by an average of 16%). automated, machine-speed attacks that mutate at a pace that cannot be neutralized through traditional response systems (43% reported an increase, by an average of 15%). Executives interviewed cited benefits of using AI in cybersecurity: 64% said it lowers the cost of detecting breaches and responding to them – by an average of 12%. 74% said it enables a faster response time: reducing time taken to detect threats, remedy breaches and implement patches by 12%. 69% also said AI improves the accuracy of detecting breaches, and 60% said it increases the efficiency of cybersecurity analysts, reducing the time they spend analyzing false positives and improving productivity. Budgets for AI in cybersecurity are projected to rise, with almost half (48%) of respondents said they are planning 29 percent increases in FY2020; some 73 percent were testing uses cases for AI in cybersecurity; only one in five organizations reported using AI in cybersecurity before 2019. “AI offers huge opportunities for cybersecurity,” stated Oliver Scherer, CISO of Europe’s leading consumer electronics retailer, MediaMarktSaturn Retail Group, in the Capgemini report. “This is because you move from detection, manual reaction and remediation towards an automated remediation, which organizations would like to achieve in the next three or five years.” Geert van der Linden, Cybersecurity Business Lead, Capgemini Group Barriers remain, including a lack of understanding in how to scale use cases from proof of concept to full-scale deployment. “Organizations are facing an unparalleled volume and complexity of cyber threats and have woken up to the importance of AI as the first line of defense,” stated Geert van der Linden, Cybersecurity Business Lead at Capgemini Group. “As cybersecurity analysts are overwhelmed, close to a quarter of them declaring they are not able to successfully investigate all identified incidents, it is critical for organizations to increase investment and focus on the business benefits that AI can bring in terms of bolstering their cybersecurity.” Read the source articles in the 2020 Data Breach Investigations Report from Verizon, in Digital Commerce 360, in SecurityBoulevard, from SpyCloud and from Capgemini Research.

0
0
3208

article-image-the-best-google-cloud-courses-to-future-proof-your-resume-from-android-development-android-authority

Matthew Emerick

15 Oct 2020

4 min read

The best Google Cloud courses to future-proof your resume from Android Development – Android Authority

Matthew Emerick

15 Oct 2020

4 min read

Google Cloud certification demonstrates proficiency using the Google Cloud Platform (GCP). The best Google Cloud courses prepare you for these exams so you can add the certification to your resume quickly and on your first try. The Google Cloud Platform is a suite of tools hosted on the cloud that businesses can use to enhance their own services. These include cloud storage, security, user profiles, machine learning, database management, IoT, and more. With Google Cloud certification, IT professionals will open doors in their careers and potentially command a higher salary. See also: What is Google Cloud certification and should I get it? Gaining certification is simple. Google offers a total of seven certificates. Each represents an understanding of specific tools offered by the platform as they apply to respective careers. Before sitting any of these exams, it’s important that professionals gain sufficient knowledge of the technologies they are being tested on. This will ensure a passing grade and avoid resits. Below, you will find some of the best Google Cloud certification courses that will prepare you for these exams. Editor’s note: We’ll be adding to this list over time as more Google Cloud courses become available. The best Google Cloud courses The Complete Google Cloud Mastery Bundle With seven different certifications and a huge roster of different skills and tools, it can be tough to know where to start with your Google Cloud certification. That’s what makes this Complete Google Cloud Mastery Bundle such a great option. It provides eight separate learning kits which include specific training relating to each of the exams, as well as a separate exam preparation “boot camp.” The course would normally cost $1,392, but Android Authority readers can get the full bundle for just $39. This really is one of the best Google Cloud courses you can find right now, at an excellent price. Why you should buy Includes an exam preparation boot camp Amazing value Information for every type of professional Why you should pass Some information may not be relevant for certain professionals Exam Bootcamp is for 2019 $33 .15 The Complete Google Cloud Mastery Bundle Use offer code: MERRYSAVE15 Save $1358 .85 Buy it Now The Complete Google Cloud Mastery Bundle Buy it Now Save $1358 .85 $33 .15 Use offer code: MERRYSAVE15 The Complete 2020 Cloud Foundation Certification Bundle Credit: Adam Sinicki / Android Authority We’ve previously discussed the benefits of Google Cloud Platform certification versus Microsoft Azure and Amazon Web Services, but ultimately the best option is to get them all. If you’re unsure of where to start, then this introductory course that teaches the basics of all three is perfect. You’ll get lifetime access to four separate courses containing 160 lessons. By the end, you’ll be ready to sit multiple exams (including CompTIA Cloud+). Why you should buy Perfect for a well-rounded knowledge Prepares you for multiple exams A large amount of content Why you should pass Not necessary if you only want Google Cloud certification $49 .99 The Complete 2020 Cloud Foundation Certification Bundle Save $1130 .01 Buy it Now The Complete 2020 Cloud Foundation Certification Bundle Buy it Now Save $1130 .01 $49 .99 The Google Cloud Certifications Practice Tests + Courses Bundle Credit: Adam Sinicki / Android Authority This is another extremely comprehensive option that includes training for all seven tests as well as practice questions. Practice tests are a great thing to look for when choosing Google Cloud courses, as they can build confidence and ensure you are ready. This course also represents fantastic value at just $29.99 (down from $639). Why you should buy Preparation for all seven exams Includes practice questions Why you should pass May have more content than you need $49 .99 The Complete 2020 Cloud Foundation Certification Bundle Save $1130 .01 Buy it Now The Complete 2020 Cloud Foundation Certification Bundle Buy it Now Save $1130 .01 $49 .99 GCP: Complete Google Data Engineer and Cloud Architect Guide This course provides access to 166 lectures and 22 hours of content, all squarely aimed at the Data Engineer and Cloud Architect certifications. This focus makes the course a great choice for anyone that knows these are their areas of interest. It also gives you a large amount of highly focussed content, which will prepare you for either the Google Data Engineer or Cloud Architect certification exams. Once again, Android Authority readers can get a large discount. The course is usually valued at $199, but if you act now, you can get the whole thing for just $15. Why you should buy A large amount of focussed content Data science and cloud technologies are up-and-coming areas Specific exam preparation Why you should pass Only relevant for the two mentioned certificates $9 .00 GCP: Complete Google Data Engineer and Cloud Architect Guide Use offer code: SPRINGSAVE40 Save $190 .00 Buy it Now GCP: Complete Google Data Engineer and Cloud Architect Guide Buy it Now Save $190 .00 $9 .00 Use offer code: SPRINGSAVE40

0
0
3270

article-image-ai-autonomous-cars-might-have-just-a-four-year-endurance-lifecycle-from-ai-trends

Matthew Emerick

15 Oct 2020

14 min read

AI Autonomous Cars Might Have Just A Four-Year Endurance Lifecycle from AI Trends

Matthew Emerick

15 Oct 2020

14 min read

0
0
3300

article-image-introducing-the-open-governance-network-model-from-linux-com

Matthew Emerick

15 Oct 2020

12 min read

Introducing the Open Governance Network Model from Linux.com

Matthew Emerick

15 Oct 2020

12 min read

0
0
999

Tech News

article-image-how-to-use-java-generics-to-avoid-classcastexceptions-from-infoworld-java

Matthew Emerick

15 Oct 2020

1 min read

How to use Java generics to avoid ClassCastExceptions from InfoWorld Java

Matthew Emerick

15 Oct 2020

1 min read

Java 5 brought generics to the Java language. In this article, I introduce you to generics and discuss generic types, generic methods, generics and type inference, generics controversy, and generics and heap pollution. download Get the code Download the source code for examples in this Java 101 tutorial. Created by Jeff Friesen for JavaWorld. What are generics? Generics are a collection of related language features that allow types or methods to operate on objects of various types while providing compile-time type safety. Generics features address the problem of java.lang.ClassCastExceptions being thrown at runtime, which are the result of code that is not type safe (i.e., casting objects from their current types to incompatible types). To read this article in full, please click here

0
0
5328

article-image-mikroorm-4-1-lets-talk-about-performance-from-dailyjs-medium

Matthew Emerick

15 Oct 2020

3 min read

MikroORM 4.1: Let’s talk about performance from DailyJS - Medium

Matthew Emerick

15 Oct 2020

3 min read

I just shipped version 4.1 of MikroORM, the TypeScript ORM for Node.js, and I feel like this particular release deserves a bit more attention than a regular feature release. In case you don’t know… If you never heard of MikroORM, it’s a TypeScript data-mapper ORM with Unit of Work and Identity Map. It supports MongoDB, MySQL, PostgreSQL and SQLite drivers currently. Key features of the ORM are: Implicit transactions ChangeSet based persistence Identity map You can read the full introductory article here or browse through the docs. So what changed? This release had only one clear goal in mind — the performance. It all started with an issue pointing out that flushing 10k entities in a single unit of work is very slow. While this kind of use case was never a target for me, I started to see all the possibilities the Unit of Work pattern offers. Batch inserts, updates and deletes The biggest performance killer was the amount of queries — even if the query is as simple and optimised as possible, firing 10k of those will be always quite slow. For inserts and deletes, it was quite trivial to group all the queries. A bit more challenging were the updates — to batch those, MikroORM now uses case statements. As a result, when you now flush changes made to one entity type, only one query per given operation (create/update/delete) will be executed. This brings significant difference, as we are now executing fixed number of queries (in fact the changes are batched in chunks of 300 items). https://medium.com/media/3df9aaa8c2f0cf018855bf66ecf3d065/href JIT compilation Second important change in 4.1 is JIT compilation. Under the hood, MikroORM now first generates simple functions for comparing and hydrating entities, that are tailored to their metadata definition. The main difference is that those generated functions are accessing the object properties directly (e.g. o.name), instead of dynamically (e.g. o[prop.name]), as all the information from metadata are inlined there. This allows V8 to better understand the code so it is able to run it faster. Results Here are the results for a simple 10k entities benchmark: In average, inserting 10k entities takes around 70ms with sqlite, updates are a tiny bit slower. You can see results for other drivers here: https://github.com/mikro-orm/benchmark. Acknowledgement Kudos to Marc J. Schmidt, the author of the initial issue, as without his help this would probably never happen, or at least not in near future. Thanks a lot! Like MikroORM? ⭐️ Star it on GitHub and share this article with your friends. If you want to support the project financially, you can do so via GitHub Sponsors. MikroORM 4.1: Let’s talk about performance was originally published in DailyJS on Medium, where people are continuing the conversation by highlighting and responding to this story.

0
0
5478

article-image-image-processing-techniques-that-you-can-use-in-machine-learning-projects-planet-scipy

Matthew Emerick

15 Oct 2020

1 min read

Image Processing Techniques That You Can Use in Machine Learning Projects - Planet SciPy

Matthew Emerick

15 Oct 2020

1 min read

Image processing is a method to perform operations on an image to extract information from it or enhance it. Digital image processing... The post Image Processing Techniques That You Can Use in Machine Learning Projects appeared first on neptune.ai.

0
0
2078

Matthew Emerick

15 Oct 2020

1 min read

Has the Time Come for IT Predictive Analytics? from DevOps.com

Matthew Emerick

15 Oct 2020

1 min read

Predictive analytics technologies have become critical to compete in manufacturing (predicting machine failure), banking (predicting fraud), e-commerce (predicting buying behavior) as well as to address horizontal use cases such as cybersecurity breach prevention and sales forecasting. Using data to predict and prevent IT outages and issues is also a growing best practice—especially as advances in […] The post Has the Time Come for IT Predictive Analytics? appeared first on DevOps.com.

0
0
2517

article-image-apache-spark-on-kubernetes-how-apache-yunikorn-incubating-helps-from-cloudera-blog

Matthew Emerick

14 Oct 2020

10 min read

Apache Spark on Kubernetes: How Apache YuniKorn (Incubating) helps from Cloudera Blog

Matthew Emerick

14 Oct 2020

10 min read

Background Why choose K8s for Apache Spark Apache Spark unifies batch processing, real-time processing, stream analytics, machine learning, and interactive query in one-platform. While Apache Spark provides a lot of capabilities to support diversified use cases, it comes with additional complexity and high maintenance costs for cluster administrators. Let’s look at some of the high-level requirements for the underlying resource orchestrator to empower Spark as a one-platform: Containerized Spark compute to provide shared resources across different ML and ETL jobs Support for multiple Spark versions, Python versions, and version-controlled containers on the shared K8s clusters for both faster iteration and stable production A single, unified infrastructure for both majority of batch workloads and microservices Fine-grained access controls on shared clusters Kubernetes as a de-facto standard for service deployment offers finer control on all of the above aspects compared to other resource orchestrators. Kubernetes offers a simplified way to manage infrastructure and applications with a practical approach to isolate workloads, limit the use of resources, deploy on-demand resources, and auto-scaling capabilities as needed. Scheduling challenges to run Apache Spark on K8s Kubernetes default scheduler has gaps in terms of deploying batch workloads efficiently in the same cluster where long-running services are also to be scheduled. Batch workloads need to be scheduled mostly together and much more frequently due to the nature of compute parallelism required. Let’s look at some of those gaps in detail. Lack of first-class application concept Batch jobs often need to be scheduled in a sequential manner based on types of container deployment. For instance, Spark driver pods need to be scheduled earlier than worker pods. A clear first-class application concept could help with ordering or queuing each container deployment. Also, such a concept helps admin to visualize the jobs which are scheduled for debugging purposes. Lack of efficient capacity/quota management capability Kubernetes namespace resource quota can be used to manage resources while running a Spark workload in multi-tenant use cases. However, there are few challenges in achieving this, Apache Spark jobs are dynamic in nature with regards to their resource usage. Namespace quotas are fixed and checked during the admission phase. The pod request is rejected if it does not fit into the namespace quota. This requires the Apache Spark job to implement a retry mechanism for pod requests instead of queueing the request for execution inside Kubernetes itself. The namespace resource quota is flat, it doesn’t support hierarchy resource quota management. Also many a time, user’s could starve to run the batch workloads as Kubernetes namespace quotas often do not match the organizational hierarchy based capacity distribution plan. An elastic and hierarchical priority management for jobs in K8s is missing today. Lack of resource fairness between tenants In a production environment, it is often found that Kubernetes default scheduler could not efficiently manage diversified workloads and provide resource fairness for their workloads. Some of the key reasons are: Batch workload management in a production environment will often be running with a large number of users. In a dense production environment where different types of workloads are running, it is highly possible that Spark driver pods could occupy all resources in a namespace. Such scenarios pose a big challenge in effective resource sharing. Abusive or corrupted jobs could steal resources easily and impact production workloads. Strict SLA requirements with scheduling latency Most of the busy production clusters dedicated for batch workloads often run thousands of jobs with hundreds of thousands of tasks every day. These workloads require larger amounts of parallel container deployments and often the lifetime of such containers is short (from seconds to hours). This usually produces a demand for thousands of pod or container deployment waiting to be scheduled, using Kubernetes default scheduler can introduce additional delays which could lead to missing of SLAs. How Apache YuniKorn (Incubating) could help Overview of Apache YuniKorn (Incubating) YuniKorn is an enhanced Kubernetes scheduler for both services and batch workloads. YuniKorn can replace Kubernetes default scheduler, or also work with K8s default scheduler based on the deployment use cases. YuniKorn brings a unified, cross-platform scheduling experience for mixed workloads consisting of stateless batch workloads and stateful services. YuniKorn v.s. Kubernetes default scheduler: A comparison Feature Default Scheduler YUNIKORN Note Application concept x √ Applications are a 1st class citizen in YuniKorn. YuniKorn schedules apps with respect to, e,g their submission order, priority, resource usage, etc. Job ordering x √ YuniKorn supports FIFO/FAIR/Priority (WIP) job ordering policies Fine-grained resource capacity management x √ Manage cluster resources with hierarchy queues. Queues provide the guaranteed resources (min) and the resource quota limit (max). Resource fairness x √ Resource fairness across application and queues to get ideal allocation for all applications running Natively support Big Data workloads x √ Default scheduler focuses for long-running services. YuniKorn is designed for Big Data app workloads, and it natively supports to run Spark/Flink/Tensorflow, etc efficiently in K8s. Scale & Performance x √ YuniKorn is optimized for performance, it is suitable for high throughput and large scale environments. How YuniKorn helps to run Spark on K8s YuniKorn has a rich set of features that help to run Apache Spark much efficiently on Kubernetes. Detailed steps can be found here to run Spark on K8s with YuniKorn. Please read more details about how YuniKorn empowers running Spark on K8s in Cloud-Native Spark Scheduling with YuniKorn Scheduler in Spark & AI summit 2020. Let’s take a look at some of the use cases and how YuniKorn helps to achieve better resource scheduling for Spark in these scenarios. Multiple users (noisy) running different spark workloads together As more users start to run jobs together, it becomes very difficult to isolate and provide required resources for the jobs with resource fairness, priority etc. YuniKorn scheduler provides an optimal solution to manage resource quotas by using resource queues. In the above example of a queue structure in YuniKorn, namespaces defined in Kubernetes are mapped to queues under the Namespaces parent queue using a placement policy. The Test and Development queue have fixed resource limits. All other queues are only limited by the size of the cluster. Resources are distributed using a Fair policy between the queues, and jobs are scheduled FIFO in the production queue. Some of the high-level advantages are, One YuniKorn queue can map to one namespace automatically in Kubernetes Queue Capacity is elastic in nature which could provide resource range from a configured min to max value Honor resource fairness which could avoid possible resource starvation YuniKorn provides a seamless way to manage resource quota for a Kubernetes cluster, it can work as a replacement of the namespace resource quota. YuniKorn resource quota management allows leveraging queuing of pod requests and sharing of limited resources between jobs based on pluggable scheduling policies. This all can be achieved without any further requirements, like retrying pod submits, on Apache Spark. Setting up the cluster to organization hierarchy based resource allocation model In a large production environment, multiple users will be running various types of workloads together. Often these users are bound to consume resources based on the organization team hierarchy budget constraints. Such a production setup helps for efficient cluster resource usage within resource quota boundaries. YuniKorn provides an ability to manage resources in a cluster with a hierarchy of queues. A fine-grained resource capacity management for a multi-tenant environment will be possible by using resource queues with clear hierarchy (like organization hierarchy). YuniKorn queues can be statically configured or dynamically managed and with the dynamic queue management feature, users can set up placement rules to delegate queue management. Better Spark job SLA in a multi-tenant cluster Normal ETL workloads running in a multi-tenant cluster require easier means of defining fine-grained policies to run jobs in the desired organizational queue hierarchy. Many times, such policies help to define stricter SLA’s for job execution. YuniKorn empowers administrators with options to enable the Job ordering in queues based on simpler policies such as FIFO, FAIR, etc. The StateAware app sorting policy orders jobs in a queue in FIFO order and schedules them one by one on conditions. This avoids the common race condition while submitting lots of batch jobs, e.g Spark, to a single namespace (or cluster). By enforcing the specific ordering of jobs, it also improves the scheduling of jobs to be more predictable. Enable various K8s feature sets for Apache Spark Job scheduling YuniKorn is fully compatible with K8s major released versions. Users can swap the scheduler transparently on an existing K8s cluster. YuniKorn fully supports all the native K8s semantics that can be used during scheduling, such as label selector, pod affinity/anti-affinity, taints/toleration, PV/PVCs, etc. YuniKorn is also compatible with the management commands and utilities, such as cordon nodes, retrieving events via kubectl, etc. Apache YuniKorn (Incubating) in CDP Cloudera’s CDP platform offers Cloudera Data Engineering experience which is powered by Apache YuniKorn (Incubating). Some of the high-level use cases solved by YuniKorn at Cloudera are, Provide resource quota management for CDE virtual clusters Provide advanced job scheduling capabilities for Spark Responsible for both micro-service and batch jobs scheduling Running on Cloud with auto-scaling enabled Future roadmaps to better support Spark workloads YuniKorn community is actively looking into some of the core feature enhancements to support Spark workloads execution. Some of the high-level features are For Spark workloads, it is essential that a minimum number of driver & worker pods be allocated for better efficient execution. Gang scheduling helps to ensure a required number of pods be allocated to start the Spark job execution. Such a feature will be very helpful in a noisy multi-tenant cluster deployment. For more details, YUNIKORN-2 Jira is tracking the feature progress. Job/Task priority support Job level priority ordering helps admin users to prioritize and direct YuniKorn to provision required resources for high SLA based job execution. This also gives more flexibility for effective usage of cluster resources. For more details, YUNIKORN-1 Jira is tracking the feature progress. Distributed Tracing YUNIKORN-387 leverages Open Tracing to improve the overall observability of the scheduler. With this feature, the critical traces through the core scheduling cycle can be collected and persisted for troubleshooting, system profiling, and monitoring. Summary YuniKorn helps to achieve fine-grained resource sharing for various Spark workloads efficiently on a large scale, multi-tenant environments on one hand and dynamically brought up cloud-native environments on the other. YuniKorn, thus empowers Apache Spark to become an enterprise-grade essential platform for users, offering a robust platform for a variety of applications ranging from large scale data transformation to analytics to machine learning. Acknowledgments Thanks to Shaun Ahmadian and Dale Richardson for reviewing and sharing comments. A huge thanks to YuniKorn open source community members who helped to get these features to the latest Apache release. The post Apache Spark on Kubernetes: How Apache YuniKorn (Incubating) helps appeared first on Cloudera Blog.

0
0
1943

article-image-using-cloudera-machine-learning-to-build-a-predictive-maintenance-model-for-jet-engines-from-cloudera-blog

Matthew Emerick

14 Oct 2020

6 min read

Using Cloudera Machine Learning to Build a Predictive Maintenance Model for Jet Engines from Cloudera Blog

Matthew Emerick

14 Oct 2020

6 min read

Introduction Running a large commercial airline requires the complex management of critical components, including fuel futures contracts, aircraft maintenance and customer expectations. Airlines, in just the U.S. alone, average about 45,000 daily flights, transporting over 10 million passengers a year (source: FAA). Airlines typically operate on very thin margins, and any schedule delay immediately angers or frustrates customers. Flying is not inherently dangerous, but the consequence of a failure is catastrophic. Airlines have such a sophisticated business model that encompasses a culture of streamlined supply chains, predictive maintenance, and unwavering customer satisfaction. To maximize safety for all passengers and crew members, while also delivering profits, airlines have heavily invested in predictive analytics to gain insight on the most cost-effective way to maintain real-time engine performance. Additionally, airlines ensure availability and reliability of their fleet by leveraging maintenance, overhaul and repair (MRO) organizations, such as Lufthansa Technik. Lufthansa Technik worked with Cloudera to build a predictive maintenance platform to service its fleet of 5000 aircraft throughout its global network of 800 MRO facilities. Lufthansa Technik extended a standard practice of placing sensors on aircraft engines and enabling predictive maintenance to automate fulfilment solutions. By combining profound airline operation expertise, data science, and engine analytics to a predictive maintenance schedule, Lufthansa Technik can now ensure critical parts are on the ground (OTG) when needed, instead of the entire aircraft being OTG and not producing revenue. The objective of this blog is to show how to use Cloudera Machine Learning (CML), running Cloudera Data Platform (CDP), to build a predictive maintenance model based on advanced machine learning concepts. The Process Many companies build machine learning models using libraries, whether they are building perception layers for autonomous vehicles, allowing autonomous vehicle operation, or modeling a complex jet engine. Kaggle, a site that provides test training data sets for building machine learning models, provides simulation data sets from NASA that measures engine component degradation for turbofan jet engines. The models in this blog are built on CML and are based on inputting various engine parameters showing typical sensor values of engine temperature, fuel consumption, vibration, or fuel to oxygen mixture (see Fig 1). One item to note in this blog is that the term “failure” is not to imply catastrophic failure, but rather, that one of its components (pumps, values, etc) is not operating to specification. Airlines design their aircraft to operate at 99.999% reliability. Fig 1: Turbofan jet engine Step 1: Using the training data to create a model/classifier First, four test and training data sets for varying conditions and failure modes were organized in preparation for CML (see box 1 in Fig 2). Each set of training data shows the engine parameters per flight while each engine is “flown” until an engine component signals failure. This is done at both sea level and all flight conditions. This data will be used to train the model that can predict how many flights a given engine has until failure. For each training set, there is a corresponding test data set that provides data on 100 jet engines at various stages of life with actual values on which to test the predictive model for accuracy. Fig 2: Diagram showing how CML is used to build ML training models Step 2: Iterate on the model to validate and improve effectiveness CML was used to create a model that estimated the amount of remaining useful life (RUL) for a given engine using the provided test and training data sets. A threshold of one week–the time allowance to place parts on the ground–was planned for a scenario that alerts an airline before a potential engine component failure. Assuming four flights daily, this means the airline would like to know with confidence if an engine is going to fail within 40 flights. The model was tested for each engine, and the results were classified as true or false for potential failure within 40 flights (see Table 1). Table 1: Data in table based on one week of data of 40 flights. Step 3: Apply an added cost value to the results With no preventative maintenance, an engine that runs out of life or fails can compromise safety and cost millions more dollars to replace an engine. If an engine is maintained or overhauled before it runs out of life, the cost of overhaul is significantly less. However, if the engine is overhauled too early, there is potential engine life that could have still been utilized. The estimated cost in this model for each of these overhaul outcomes can be seen below (see Fig 3). Fig 3: Cost-benefit confusion matrix Conclusion Using Cloudera Machine Learning to analyze NASA jet engine simulation data provided by Kaggle, our predictive maintenance model predicted when an engine was likely to fail or when it required an overhaul with very high accuracy. Combining the cost-benefit analysis with this predictive model against the test data sets suggested significant savings across all applied scenarios. Airline decisions are always made with a consideration to safety first and then consideration to profit second. Predictive maintenance is preferred because it is always the safest choice, and it delivers drastically lower maintenance costs over reactive (engine replacement after failure) or proactive (replacing components before engine replacement) approaches. Next Steps To see all this in action, please click on links below to a few different sources showcasing the process that was created. Video – If you’d like to see and hear how this was built, see video at the link. Tutorials – If you’d like to do this at your own pace, see a detailed walkthrough with screenshots and line by line instructions of how to set this up and execute. Meetup – If you want to talk directly with experts from Cloudera, please join a virtual meetup to see a live stream presentation. There will be time for direct Q&A at the end. CDP Users Page – To learn about other CDP resources built for users, including additional video, tutorials, blogs and events, click on the link. The post Using Cloudera Machine Learning to Build a Predictive Maintenance Model for Jet Engines appeared first on Cloudera Blog.

0
0
1806

article-image-election-2020-how-data-can-show-whats-driving-the-trump-vs-biden-polls-from-whats-new

Matthew Emerick

14 Oct 2020

8 min read

Election 2020: How data can show what’s driving the Trump vs. Biden polls from What's New

Matthew Emerick

14 Oct 2020

8 min read

Steve Schwartz Director, Public Affairs at Tableau Tanna Solberg October 14, 2020 - 5:19pm October 14, 2020 It’s October, which means there is officially less than one month until the 2020 Presidential election on November 3. Opinion polls on the race between current President Donald Trump and the challenger, former Vice President Joe Biden, are everywhere. Although many people see public opinion polls as a way to anticipate the outcome of the election, they are most valuable when considered as a snapshot of people's beliefs at a given moment in time. Through our partnership with SurveyMonkey and Axios to collect and share data on the 2020 Presidential race, we’ve created a dashboard where you can track how survey respondents are feeling about the candidates. But looking at candidate preference data alone doesn’t answer the critical question of this years’ election: What is driving voter preference? This year, that’s an especially tricky question. There are the major issues confronting the country this year—from challenges like the COVID-19 pandemic, to the disease’s impact on the national and global economies, to the nationwide protests for racial justice and equity. And there’s also the news cycle which seemingly tosses another knuckleball at voters before they’ve had a chance to process the last one. By partnering with SurveyMonkey, we’ve been able to tap into their vast market research technologies to reach the public and visualize their answers to these critical questions. Through our Election 2020 platform, you can dig into this data and expand your understanding of not only what the topline polls are saying, but what is top-of-mind for the voters making the decision this year. We’ll walk you through some of the key data you can find on our Election 2020 pages, and why it’s so critical to understanding this year’s political landscape. Preferences by demographics Understanding the way different demographic groups vote is critical. It’s very common for pollsters to break down results by categories like age bracket, race, and gender: Disaggregating data offers valuable insights into trends among voter groups that can inform understanding of potential election results. But the way the data is often presented—either in static crosstabs, or in percentage points scattered throughout an analysis—doesn’t really give people the insight into voters’ intersectionalities, and how they play out in the data. SurveyMonkey wanted to give people a way to explore demographic data in a more granular and comprehensive way. They’ve broken down data on candidate preference by five different demographic categories—age, race, gender, education level, and party ID—and in a Tableau dashboard, anyone can choose which categories to combine to see more nuanced voter preferences. For instance, if one were to just look at gender, the breakdown would be pretty clear: 52% of men support Trump, and 56% of women support Biden. But in this dashboard, you can also choose to layer in race. Suddenly, the picture becomes much more complex: 87% of Black women support Biden, and 76% of Black men support Biden. On the flip side, just 38% of white men support Biden, and 44% of white women support Trump. If you add in another dimension, like education, the numbers become even clearer: By far, the group that most strongly supports Trump (at 70%) is white men with a high school degree or less, and the group with the strongest across-the-board support for Biden is Black women with a postgraduate degree (91%). "From the perspective of someone who's immersed in crosstabs and bar charts every day, this visualization is the clearest example yet of the value of pairing data collected through SurveyMonkey's mighty scale with visual storytelling tools from Tableau. The fact that it's highly interactive and responsive really brings the data to life in a way that isn't possible using standard tools,” Wronski says. The COVID-19 pandemic Let’s start with the big one. COVID-19 has posed one of the most significant challenges to the United States and its citizens in recent memory. Over 200,000 people have died, and the economy has recorded its steepest-ever drop on record, with the GDP declining more than 9%. As we near the Election, the virus is not showing signs of abating (for the latest data on COVID-19, you can visit Tableau’s COVID-19 Data Hub). Our partners at SurveyMonkey have been tracking public sentiment around the pandemic since February, as it’s impacting the lives of nearly everyone in the United States this year. "The coronavirus pandemic has infiltrated every aspect of life for the past eight months, and it will continue to do so for the foreseeable future. We wanted to make sure to start measuring concerns early on, and we're committed to tracking public sentiment on this topic for as long as necessary,” says Laura Wronski, research science manager at SurveyMonkey. Through our Election 2020 portal, you can analyze data on how the public is feeling about the pandemic in the leadup to the election. SurveyMonkey has asked respondents about their personal concerns around the virus—if they are worried about contracting it themselves, or someone in their family being affected, and if they are worried about the pandemic’s impact on the economy. Because SurveyMonkey’s Tableau dashboards make it easy to filter these responses by a number of demographic factors—from age to political affiliation—you can begin to see patterns in the data, and understand how concerns around COVID-19 could be a key factor in shaping the outcome of the election. Government leadership Elections are nearly always a referendum on leadership, and this year is no different. However, the pandemic is adding a new layer to how voters assess their elected leaders across the country. "As the election approaches, politicians who are on the ballot at every level will be judged by how well they responded to the coronavirus this year, both in terms of its effect on the economy through lost jobs and shuttered businesses and in terms of the public health infrastructure's response,” Wronski says. Digging into the data, you can see virtually no difference along any demographic breakdown between people’s assessment of Trump as a leader overall, and people’s opinion of how he is handling the federal response to COVID-19. That can tell you several things: That voters’ opinions are, at this point, fairly solidified, and also that COVID-19 is a significant driver of that opinion. Digging into the data on how respondents feel about their state government’s response to COVID-19 shows some interesting trends. The clearest split, in many states, seems to be along party lines. In Pennsylvania, for instance, 82% of Democrats approve of the state response, while 71% of Republicans disapprove. In South Carolina, 73% of Republicans approve of the response, and 74% of Republicans disapprove. It gets much more interesting along other demographic lines, though. Here’s the opinion split along gender lines in Pennsylvania: 60% of women approve, and 49% of men disapprove. And in South Carolina, 54% of women disapprove, and 54% of men approve. "Like so much else these days, Republicans and Democrats are split in their views of how worrisome the coronavirus is and how well we've responded to it. Those partisan effects far outweigh any differences by age, gender, race, or other demographic characteristics,” Wronski says. Voting COVID-19 has complicated nearly every aspect of the 2020 Election, including voting. Multiple news outlets are reporting a sharp uptick in requests to vote by mail this year, due to concerns about gathering in public amid a pandemic. But through their data on how likely people are to vote by mail, SurveyMonkey is able to show a clear split along party lines. Overall, 70% of Democrat respondents say they’re likely to vote by mail, and 72% of Republican respondents say the opposite. Axios, our media partner in our Election 2020 initiative, has analyzed what this means in the context of the potential outcome, and what the implications could be if mail-in ballots are disqualified due to complications with the system. "More people will vote by mail in this election than in any previous election, and that will reshape the logistics of the electoral tallying process and the entire narrative that we see on the news on Election Day. It's important for us to understand those dynamics early on so that we can help explain those changes to the public,” Wronski says. Exploring with data Now that you have a sense of the information SurveyMonkey is polling for and why—and how to discover it in Tableau—we hope you take some time to dig into the data and gather your own insights. As the election nears, SurveyMonkey, Tableau, and Axios will continue to deliver more data and analysis around the political landscape, so make sure you keep checking back to the Election 2020 page for the latest.

0
0
977

article-image-tom-swartz-tuning-your-postgres-database-for-high-write-loads-from-planet-postgresql

Matthew Emerick

14 Oct 2020

1 min read

Tom Swartz: Tuning Your Postgres Database for High Write Loads from Planet PostgreSQL

Matthew Emerick

14 Oct 2020

1 min read

As a database grows and scales up from a proof of concept to a full-fledged production instance, there are always a variety of growing pains that database administrators and systems administrators will run into. Very often, the engineers on the Crunchy Data support team help support enterprise projects which start out as small, proof of concept systems, and are then promoted to large scale production uses. As these systems receive increased traffic load beyond their original proof-of-concept sizes, one issue may be observed in the Postgres logs as the following: LOG: checkpoints are occurring too frequently (9 seconds apart) HINT: Consider increasing the configuration parameter "max_wal_size". LOG: checkpoints are occurring too frequently (2 seconds apart) HINT: Consider increasing the configuration parameter "max_wal_size".

0
0
1303

article-image-why-congress-should-invest-in-open-source-software-brookings-from-linux-com

Matthew Emerick

14 Oct 2020

2 min read

Why Congress should invest in open-source software (Brookings) from Linux.com

Matthew Emerick

14 Oct 2020

2 min read

Frank Nagle at Brookings writes: As the pandemic has highlighted, our economy is increasingly reliant on digital infrastructure. As more and more in-person interactions have moved online, products like Zoom have become critical infrastructure supporting business meetings, classroom education, and even congressional hearings. Such communication technologies build on FOSS and rely on the FOSS that is deeply ingrained in the core of the internet. Even grocery shopping, one of the strongholds of brick and mortar retail, has seen an increased reliance on digital technology that allows higher-risk shoppers to pay someone to shop for them via apps like InstaCart (which itself relies on, and contributes to, FOSS). As the pandemic has highlighted, our economy is increasingly reliant on digital infrastructure. As more and more in-person interactions have moved online, products like Zoom have become critical infrastructure supporting business meetings, classroom education, and even congressional hearings. Such communication technologies build on FOSS and rely on the FOSS that is deeply ingrained in the core of the internet. Even grocery shopping, one of the strongholds of brick and mortar retail, has seen an increased reliance on digital technology that allows higher-risk shoppers to pay someone to shop for them via apps like InstaCart (which itself relies on, and contributes to, FOSS). Read more at Brookings The post Why Congress should invest in open-source software (Brookings) appeared first on Linux.com.

0
0
952

Tech News

Matthew Emerick

14 Oct 2020

1 min read

Bruce Momjian: Thirty Years of Continuous PostgreSQL Development from Planet PostgreSQL

Matthew Emerick

14 Oct 2020

1 min read

I did an interview with EDB recently, and a blog post based on that interview was published yesterday. It covers the Postgres 13 feature set and the effects of open source on the software development process.

0
0
1705

Startup Focus: Sea Machines Winning Contracts for Autonomous Marine Systems from AI Trends

Web Applications are Focus of Cybercrime Gangs in Data Breaches, Report Finds from AI Trends

The best Google Cloud courses to future-proof your resume from Android Development – Android Authority

AI Autonomous Cars Might Have Just A Four-Year Endurance Lifecycle from AI Trends

Introducing the Open Governance Network Model from Linux.com

How to use Java generics to avoid ClassCastExceptions from InfoWorld Java

MikroORM 4.1: Let’s talk about performance from DailyJS - Medium

Image Processing Techniques That You Can Use in Machine Learning Projects - Planet SciPy

Has the Time Come for IT Predictive Analytics? from DevOps.com

Apache Spark on Kubernetes: How Apache YuniKorn (Incubating) helps from Cloudera Blog

Trending Topics

Using Cloudera Machine Learning to Build a Predictive Maintenance Model for Jet Engines from Cloudera Blog

Election 2020: How data can show what’s driving the Trump vs. Biden polls from What's New

Tom Swartz: Tuning Your Postgres Database for High Write Loads from Planet PostgreSQL

Why Congress should invest in open-source software (Brookings) from Linux.com

Bruce Momjian: Thirty Years of Continuous PostgreSQL Development from Planet PostgreSQL