Amazon MSK now supports the ability to change the size or family of your...
Feed: Recent Announcements. You can now scale your Amazon Managed Streaming for Apache Kafka (MSK) clusters on demand by changing the size or family of your brokers without reassigning Apache Kafka...
View ArticleStoring Multi-Tenant SaaS Data in a Serverless Environment with Amazon Keyspaces
Feed: AWS Partner Network (APN) Blog. Author: Tod Golding. By Tod Golding, Principal Partner Solutions Architect, AWS SaaS FactoryBy Muhammad Sadiqain Tahir, Contributing Writer With the introduction...
View ArticleSnowflake Cloud Data Platform Architecture & Basic Concepts
Feed: Matillion. Author: Julie Polito; As big data continues to get bigger, more organizations are turning to cloud data warehouses. The cloud is the only platform that provides the flexibility and...
View ArticleHow EMX reduced data pipeline costs by 85% with Amazon Athena
Feed: AWS Big Data Blog. This is a guest blog post by Gary Bouton and Louis Ashner from EMX. In their own words, “ENGINE Media Exchange (EMX) is a leading marketing technology company, leveraging a...
View ArticleHow Baqend built a real-time web analytics platform using Amazon Kinesis Data...
Feed: AWS Big Data Blog. This is a customer post written by the engineers from German startup Baqend and the AWS EMEA Prototyping Labs team. Baqend is one of the fastest-growing software as a service...
View ArticleAutomating AWS service logs table creation and querying them with Amazon Athena
Feed: AWS Big Data Blog. I was working with a customer who was just getting started using AWS, and they wanted to understand how to query their AWS service logs that were being delivered to Amazon...
View ArticleAndreas ‘ads’ Scherbaum: Hubert Lubaczewski
Feed: Planet PostgreSQL. Date: 08.02.2021 Tags: postgresql explain perl postgresql psql linux Category: Interviews Interviewed by: Andreas Scherbaum PostgreSQL is the World’s...
View ArticleHalil Ozan Akgul: Citus Tips: How to undistribute a distributed Postgres table
Feed: Planet PostgreSQL. This post by Halil Ozan Akgul about Citus was originally published on the Azure Database for PostgreSQL Blog on Microsoft TechCommunity. Once you start using the Citus...
View ArticleQuerying a Vertica data source in Amazon Athena using the Athena Federated...
Feed: AWS Big Data Blog. The ability to query data and perform ad hoc analysis across multiple platforms and data stores with a single tool brings immense value to the big data analytical arena. As...
View Articleahsan hadi: Exploring SQL command changes in PG-14
Feed: Planet PostgreSQL. I have spent sometime exploring all the SQL command changes in PostgreSQL 14, this blog will provide a brief summary about the SQL changes along with some examples. There are...
View ArticleRyan Lambert: Why Partition OpenStreetMap data?
Feed: Planet PostgreSQL. By Ryan Lambert — Published February 14, 2021 This post covers the first part of my path in considering native Postgres partitioning and how it might be helpful to my work with...
View ArticleSome computational redistricting methods: or, how to sniff out a gerrymander...
Feed: R-bloggers. Author: Jason Timm. [This article was first published on Jason Timm, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here) Want to share...
View ArticleRyan Lambert: Partition OpenStreetMap data in PostGIS
Feed: Planet PostgreSQL. By Ryan Lambert — Published February 16, 2021 This post continues my quest to explore Postgres native partitioning and determine if it is a good fit for my OpenStreetMap data...
View ArticleAWS Glue Studio jobs can now update AWS Glue Data Catalog tables
Feed: Recent Announcements. AWS Glue Studio now supports updating the AWS Glue Data Catalog during job runs. This feature makes it easy to keep your tables up to date as AWS Glue writes new data into...
View ArticleHans-Juergen Schoenig: PostgreSQL: CREATE STATISTICS – advanced query...
Feed: Planet PostgreSQL. PostgreSQL query optimization with CREATE STATISTICS is an important topic. Usually, the PostgreSQL optimizer (query planner) does an excellent job. This is not only true for...
View ArticleAmazon MSK backup for Archival, Replay, or Analytics
Feed: AWS Architecture Blog. Amazon MSK is a fully managed service that helps you build and run applications that use Apache Kafka to process streaming data. Apache Kafka is an open-source platform for...
View ArticleCreate a custom data connector to Slack’s Member Analytics API in Amazon...
Feed: AWS Big Data Blog. Amazon QuickSight recently added support for Amazon Athena Federated Query, which allows you to query data in place from various data sources. With this capability, QuickSight...
View ArticleRyan Lambert: First Review of Partitioning OpenStreetMap
Feed: Planet PostgreSQL. By Ryan Lambert — Published February 21, 2021 My previous two posts set the stage to evaluate declarative Postgres partitioning for OpenStreetMap data. This post outlines what...
View ArticleAndreas ‘ads’ Scherbaum: Takayuki Tsunakawa
Feed: Planet PostgreSQL. Date: 22.02.2021 Tags: postgresql japan contributor fujitsu video-game virtual-reality visually-impaired Category: Interviews Interviewed by:...
View ArticleRun usage analytics on Amazon QuickSight using AWS CloudTrail
Feed: AWS Big Data Blog. Amazon QuickSight is a cloud-native BI service that allows end users to create and publish dashboards in minutes, without provisioning any servers or requiring complex...
View ArticleEvolving Schemaless into a Distributed SQL Database
Feed: Planet MySQL; Author: Ovais Tariq; In 2016 we published blog posts (I, II) about Schemaless – Uber Engineering’s Scalable Datastore. We went over the design of Schemaless as well as explained the...
View ArticleTable Partitioning In MySQL NDB Cluster and What’s New (Part I)
Feed: Planet MySQL; Author: Saroj Tripathy; This blog is about table partitioning in NDB Cluster. We will see how this feature has been enhanced from version to version. Also we will see which...
View ArticleEffective data lakes using AWS Lake Formation, Part 1: Getting started with...
Feed: AWS Big Data Blog. Thousands of customers are building their data lakes on Amazon Simple Storage Service (Amazon S3). You can use AWS Lake Formation to build your data lakes easily—in a matter of...
View ArticleBuild a data lake using Amazon Kinesis Data Streams for Amazon DynamoDB and...
Feed: AWS Big Data Blog. Amazon DynamoDB helps you capture high-velocity data such as clickstream data to form customized user profiles and online order transaction data to develop customer order...
View ArticleMarco Slot: Citus 10: Columnar for Postgres, rebalancer, single-node, & more
Feed: Planet PostgreSQL. Development on Citus first started around a decade ago and once a year we release a major new Citus open source version. We wanted to make number 10 something special, but I...
View ArticleJeff Davis: Citus 10 brings columnar compression to Postgres
Feed: Planet PostgreSQL. Citus 10 is out! Check out the Citus 10 blog post for all the details. Citus is an open source extension to Postgres (not a fork) that enables scale-out, but offers other great...
View ArticleIngest Salesforce data into Amazon S3 using the CData JDBC custom connector...
Feed: AWS Big Data Blog. Organizations that successfully generate business value from their data will outperform their peers. Many AWS customers require a data storage and analytics solution that...
View ArticleRonDB, automatic thread configuration
Feed: Planet MySQL; Author: Mikael Ronström; This blog introduces how RonDB handles automatic thread configuration. This blog is more technical and dives deeper under the surface of how RonDB operates....
View ArticleArchiving Amazon MSK Data to Amazon S3 with the Lenses.io S3 Kafka Connect...
Feed: AWS Partner Network (APN) Blog. Author: Darragh O’Flanagan. By Darragh O’Flanagan, Sr. Partner Solutions Architect at AWSBy Guillaume Aymé, Director Marketing, at Lenses.io As a stream processing...
View ArticleTable Partitioning In MySQL NDB Cluster And What’s New (Part II)
Feed: Planet MySQL; Author: Saroj Tripathy; Whats new in NDB Cluster 7.5 version In this version, users have more flexible ways of table partitioning rather than the default way thru ldm. Now user can...
View ArticleNew features from Apache Hudi available in Amazon EMR
Feed: AWS Big Data Blog. Apache Hudi is an open-source data management framework used to simplify incremental data processing and data pipeline development by providing record-level insert, update and...
View ArticleAutomating dynamic mapping and renaming of column names in data files using...
Feed: AWS Big Data Blog. In Part 1 of this two-part post, we looked at how we can create an AWS Glue ETL job that is agnostic enough to rename columns of a data file by mapping to column names of...
View ArticleDesigning a Thread Pipeline for optimal database throughput with high IPC and...
Feed: Planet MySQL; Author: Mikael Ronström; There are a couple of questions about the blog post on automatic thread configuration in RonDB. Rather than providing an extensive answer in the comment...
View ArticleTable partitioning in MySQL NDB Cluster and what’s new (Part III)
Feed: Planet MySQL; Author: Saroj Tripathy; Whats new in NDB Cluster 7.5 version (Contd.) In cluster 7.5 the READ_BACKUP and FULLY_REPLICATED table features were added. These features are both designed...
View ArticleExtract multidimensional data from Microsoft SQL Server Analysis Services...
Feed: AWS Big Data Blog. AWS Glue is fully managed service that makes it easier for you to extract, transform, and load (ETL) data for analytics. You can easily create ETL jobs to connect to backend...
View ArticleBuild a serverless tracking pixel solution in AWS
Feed: AWS Big Data Blog. Let’s describe the typical use case where a tracking pixel solution, also known as a web beacon, might help you: Analyzing web traffic is critical to understanding user...
View ArticleHans-Juergen Schoenig: How the PostgreSQL query optimizer works
Feed: Planet PostgreSQL. Just like any advanced relational database, PostgreSQL uses a cost-based query optimizer that tries to turn your SQL queries into something efficient that executes in as little...
View ArticleHazelcast IMDG 4.2 GA is Released!
Feed: Blog – Hazelcast. Author: Jiri Holusa. It’s a pleasure to announce the general availability of Hazelcast IMDG 4.2! In this blog post, we highlight the most notable features and enhancements in...
View ArticleTakayuki Tsunakawa: Faster Bulk Insertion to Foreign Tables – Introduction to...
Feed: Planet PostgreSQL. Introduction Currently, the global PostgreSQL development team, led by Amit Kapila, continues to contribute to the community as part of Fujitsu’s work on PostgreSQL. As a...
View ArticleOverview of MySQL Alternative Storage Engines
Feed: Planet MySQL; Author: MySQL Performance Blog; For MySQL, MyISAM and InnoDB storage engines are very popular. Currently, we are mostly using InnoDB engines for high reliability and high...
View ArticleMigrate terabytes of data quickly from Google Cloud to Amazon S3 with AWS...
Feed: AWS Big Data Blog. The cloud is often seen as advantageous for data lakes because of better security, faster time to deployment, better availability, more frequent feature and functionality...
View ArticleSimplify data integration pipeline development using AWS Glue custom blueprints
Feed: AWS Big Data Blog. Organizations spend significant time developing and maintaining data integration pipelines that hydrate data warehouses, data lakes, and lake houses. As data volume increases,...
View ArticleEmil Shkolnik: Is Greenplum Database “just a big sharded PostgreSQL”?
Feed: Planet PostgreSQL. 29 Mar Is Greenplum Database “just a big sharded PostgreSQL”? Post Views: 641 Introduction What is Greenplum Database? This is on of PostgreSQL forks optimized for OLAP and...
View ArticleTable Partitioning In MySQL NDB Cluster And What’s New (Part IV)
Feed: Planet MySQL; Author: Saroj Tripathy; Whats new in NDB Cluster 8.0 version (8.0.23) With new configuration variables introduced in NDB cluster version 8.0.23, user now have more control in table...
View ArticleAutomate dynamic mapping and renaming of column names in data files using AWS...
Feed: AWS Big Data Blog. In Part 1 of this two-part post, we looked at how we can create an AWS Glue ETL job that is agnostic enough to rename columns of a data file by mapping to column names of...
View ArticleHow 1Strategy simplified their spreadsheet ETL process using AWS Glue DataBrew
Feed: AWS Big Data Blog. This is a guest blog post by Pat Reilly and Gary Houk at 1Strategy. In their own words, “1Strategy is an APN Premier Consulting Partner focusing exclusively on AWS solutions....
View ArticleEasily ingest and analyze Google Analytics data with Upsolver and Amazon AppFlow
Feed: AWS Big Data Blog. This post is co-written by Mei Long at Upsolver. Software as a service (SaaS) based applications are in demand today, and customers have growing need for adopting many of them...
View ArticleJourney to Adopt Cloud-Native Architecture Series: #2 – Maximizing System...
Feed: AWS Architecture Blog. In the last blog, Preparing your Applications for Hypergrowth, we talked about hypergrowth and the technical challenges it presents to companies. As a reminder, we...
View ArticleAmazon EC2 now allows you to copy Amazon Machine Images across AWS GovCloud,...
Feed: Recent Announcements. You can now quickly and conveniently copy Amazon Machine Images (AMIs) to AWS GovCloud (US) Regions, AWS China Regions and other AWS regions (also known as partitions) to...
View ArticleProcessing satellite imagery with serverless architecture
Feed: AWS Compute Blog. Author: James Beswick. This post was written by Justin Downes, Machine Learning Consultant. The amount of satellite imagery publicly available is growing and images from...
View Article