partitions – Cloud Data Architect

↧

Amazon MSK now supports the ability to change the size or family of your...

January 21, 2021, 3:07 pm

Feed: Recent Announcements. You can now scale your Amazon Managed Streaming for Apache Kafka (MSK) clusters on demand by changing the size or family of your brokers without reassigning Apache Kafka...

View Article

Image may be NSFW.
Clik here to view.

Storing Multi-Tenant SaaS Data in a Serverless Environment with Amazon Keyspaces

January 25, 2021, 3:52 pm

Feed: AWS Partner Network (APN) Blog. Author: Tod Golding. By Tod Golding, Principal Partner Solutions Architect, AWS SaaS FactoryBy Muhammad Sadiqain Tahir, Contributing Writer With the introduction...

View Article

Image may be NSFW.
Clik here to view.

Snowflake Cloud Data Platform Architecture & Basic Concepts

January 27, 2021, 1:00 am

Feed: Matillion. Author: Julie Polito; As big data continues to get bigger, more organizations are turning to cloud data warehouses. The cloud is the only platform that provides the flexibility and...

View Article

Image may be NSFW.
Clik here to view.

How EMX reduced data pipeline costs by 85% with Amazon Athena

February 2, 2021, 10:29 am

Feed: AWS Big Data Blog. This is a guest blog post by Gary Bouton and Louis Ashner from EMX. In their own words, “ENGINE Media Exchange (EMX) is a leading marketing technology company, leveraging a...

View Article

Image may be NSFW.
Clik here to view.

How Baqend built a real-time web analytics platform using Amazon Kinesis Data...

February 5, 2021, 11:56 am

Feed: AWS Big Data Blog. This is a customer post written by the engineers from German startup Baqend and the AWS EMEA Prototyping Labs team. Baqend is one of the fastest-growing software as a service...

View Article

Image may be NSFW.
Clik here to view.

Automating AWS service logs table creation and querying them with Amazon Athena

February 5, 2021, 12:09 pm

Feed: AWS Big Data Blog. I was working with a customer who was just getting started using AWS, and they wanted to understand how to query their AWS service logs that were being delivered to Amazon...

View Article

Andreas ‘ads’ Scherbaum: Hubert Lubaczewski

February 8, 2021, 6:00 am

Feed: Planet PostgreSQL. Date: 08.02.2021 Tags: postgresql explain perl postgresql psql linux Category: Interviews Interviewed by: Andreas Scherbaum PostgreSQL is the World’s...

View Article

Halil Ozan Akgul: Citus Tips: How to undistribute a distributed Postgres table

February 8, 2021, 11:00 pm

Feed: Planet PostgreSQL. This post by Halil Ozan Akgul about Citus was originally published on the Azure Database for PostgreSQL Blog on Microsoft TechCommunity. Once you start using the Citus...

View Article

Image may be NSFW.
Clik here to view.

Querying a Vertica data source in Amazon Athena using the Athena Federated...

February 11, 2021, 11:43 am

Feed: AWS Big Data Blog. The ability to query data and perform ad hoc analysis across multiple platforms and data stores with a single tool brings immense value to the big data analytical arena. As...

View Article

Image may be NSFW.
Clik here to view.

ahsan hadi: Exploring SQL command changes in PG-14

February 12, 2021, 2:12 am

Feed: Planet PostgreSQL. I have spent sometime exploring all the SQL command changes in PostgreSQL 14, this blog will provide a brief summary about the SQL changes along with some examples. There are...

View Article

Ryan Lambert: Why Partition OpenStreetMap data?

February 13, 2021, 9:01 pm

Feed: Planet PostgreSQL. By Ryan Lambert — Published February 14, 2021 This post covers the first part of my path in considering native Postgres partitioning and how it might be helpful to my work with...

View Article

Image may be NSFW.
Clik here to view.

Some computational redistricting methods: or, how to sniff out a gerrymander...

February 15, 2021, 11:11 pm

Feed: R-bloggers. Author: Jason Timm. [This article was first published on Jason Timm, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here) Want to share...

View Article

Ryan Lambert: Partition OpenStreetMap data in PostGIS

February 15, 2021, 9:01 pm

Feed: Planet PostgreSQL. By Ryan Lambert — Published February 16, 2021 This post continues my quest to explore Postgres native partitioning and determine if it is a good fit for my OpenStreetMap data...

View Article

AWS Glue Studio jobs can now update AWS Glue Data Catalog tables

February 17, 2021, 4:03 pm

Feed: Recent Announcements. AWS Glue Studio now supports updating the AWS Glue Data Catalog during job runs. This feature makes it easy to keep your tables up to date as AWS Glue writes new data into...

View Article

Hans-Juergen Schoenig: PostgreSQL: CREATE STATISTICS – advanced query...

February 18, 2021, 5:00 am

Feed: Planet PostgreSQL. PostgreSQL query optimization with CREATE STATISTICS is an important topic. Usually, the PostgreSQL optimizer (query planner) does an excellent job. This is not only true for...

View Article

Image may be NSFW.
Clik here to view.

Amazon MSK backup for Archival, Replay, or Analytics

February 19, 2021, 10:03 am

Feed: AWS Architecture Blog. Amazon MSK is a fully managed service that helps you build and run applications that use Apache Kafka to process streaming data. Apache Kafka is an open-source platform for...

View Article

Image may be NSFW.
Clik here to view.

Create a custom data connector to Slack’s Member Analytics API in Amazon...

February 19, 2021, 5:54 pm

Feed: AWS Big Data Blog. Amazon QuickSight recently added support for Amazon Athena Federated Query, which allows you to query data in place from various data sources. With this capability, QuickSight...

View Article

Ryan Lambert: First Review of Partitioning OpenStreetMap

February 20, 2021, 9:01 pm

Feed: Planet PostgreSQL. By Ryan Lambert — Published February 21, 2021 My previous two posts set the stage to evaluate declarative Postgres partitioning for OpenStreetMap data. This post outlines what...

View Article

Andreas ‘ads’ Scherbaum: Takayuki Tsunakawa

February 22, 2021, 6:00 am

Feed: Planet PostgreSQL. Date: 22.02.2021 Tags: postgresql japan contributor fujitsu video-game virtual-reality visually-impaired Category: Interviews Interviewed by:...

View Article

Image may be NSFW.
Clik here to view.

Run usage analytics on Amazon QuickSight using AWS CloudTrail

February 24, 2021, 1:44 pm

Feed: AWS Big Data Blog. Amazon QuickSight is a cloud-native BI service that allows end users to create and publish dashboards in minutes, without provisioning any servers or requiring complex...

View Article

Image may be NSFW.
Clik here to view.

Evolving Schemaless into a Distributed SQL Database

February 26, 2021, 11:15 pm

Feed: Planet MySQL; Author: Ovais Tariq; In 2016 we published blog posts (I, II) about Schemaless – Uber Engineering’s Scalable Datastore. We went over the design of Schemaless as well as explained the...

View Article

Image may be NSFW.
Clik here to view.

Table Partitioning In MySQL NDB Cluster and What’s New (Part I)

March 1, 2021, 2:12 am

Feed: Planet MySQL; Author: Saroj Tripathy; This blog is about table partitioning in NDB Cluster. We will see how this feature has been enhanced from version to version. Also we will see which...

View Article

Image may be NSFW.
Clik here to view.

Effective data lakes using AWS Lake Formation, Part 1: Getting started with...

March 3, 2021, 9:07 am

Feed: AWS Big Data Blog. Thousands of customers are building their data lakes on Amazon Simple Storage Service (Amazon S3). You can use AWS Lake Formation to build your data lakes easily—in a matter of...

View Article

Image may be NSFW.
Clik here to view.

Build a data lake using Amazon Kinesis Data Streams for Amazon DynamoDB and...

March 4, 2021, 1:15 pm

Feed: AWS Big Data Blog. Amazon DynamoDB helps you capture high-velocity data such as clickstream data to form customized user profiles and online order transaction data to develop customer order...

View Article

Marco Slot: Citus 10: Columnar for Postgres, rebalancer, single-node, & more

March 5, 2021, 9:00 am

Feed: Planet PostgreSQL. Development on Citus first started around a decade ago and once a year we release a major new Citus open source version. We wanted to make number 10 something special, but I...

View Article

Jeff Davis: Citus 10 brings columnar compression to Postgres

March 6, 2021, 9:00 am

Feed: Planet PostgreSQL. Citus 10 is out! Check out the Citus 10 blog post for all the details. Citus is an open source extension to Postgres (not a fork) that enables scale-out, but offers other great...

View Article

Image may be NSFW.
Clik here to view.

Ingest Salesforce data into Amazon S3 using the CData JDBC custom connector...

March 9, 2021, 9:12 am

Feed: AWS Big Data Blog. Organizations that successfully generate business value from their data will outperform their peers. Many AWS customers require a data storage and analytics solution that...

View Article

Image may be NSFW.
Clik here to view.

RonDB, automatic thread configuration

March 10, 2021, 10:05 am

Feed: Planet MySQL; Author: Mikael Ronström; This blog introduces how RonDB handles automatic thread configuration. This blog is more technical and dives deeper under the surface of how RonDB operates....

View Article

Image may be NSFW.
Clik here to view.

Archiving Amazon MSK Data to Amazon S3 with the Lenses.io S3 Kafka Connect...

March 10, 2021, 12:44 pm

Feed: AWS Partner Network (APN) Blog. Author: Darragh O’Flanagan. By Darragh O’Flanagan, Sr. Partner Solutions Architect at AWSBy Guillaume Aymé, Director Marketing, at Lenses.io As a stream processing...

View Article

Image may be NSFW.
Clik here to view.

Table Partitioning In MySQL NDB Cluster And What’s New (Part II)

March 10, 2021, 9:42 pm

Feed: Planet MySQL; Author: Saroj Tripathy; Whats new in NDB Cluster 7.5 version In this version, users have more flexible ways of table partitioning rather than the default way thru ldm. Now user can...

View Article

Image may be NSFW.
Clik here to view.

New features from Apache Hudi available in Amazon EMR

March 11, 2021, 1:22 pm

Feed: AWS Big Data Blog. Apache Hudi is an open-source data management framework used to simplify incremental data processing and data pipeline development by providing record-level insert, update and...

View Article

Image may be NSFW.
Clik here to view.

Automating dynamic mapping and renaming of column names in data files using...

March 12, 2021, 11:03 am

Feed: AWS Big Data Blog. In Part 1 of this two-part post, we looked at how we can create an AWS Glue ETL job that is agnostic enough to rename columns of a data file by mapping to column names of...

View Article

Image may be NSFW.
Clik here to view.

Designing a Thread Pipeline for optimal database throughput with high IPC and...

March 15, 2021, 5:00 pm

Feed: Planet MySQL; Author: Mikael Ronström; There are a couple of questions about the blog post on automatic thread configuration in RonDB. Rather than providing an extensive answer in the comment...

View Article

Image may be NSFW.
Clik here to view.

Table partitioning in MySQL NDB Cluster and what’s new (Part III)

March 19, 2021, 12:12 pm

Feed: Planet MySQL; Author: Saroj Tripathy; Whats new in NDB Cluster 7.5 version (Contd.) In cluster 7.5 the READ_BACKUP and FULLY_REPLICATED table features were added. These features are both designed...

View Article

Image may be NSFW.
Clik here to view.

Extract multidimensional data from Microsoft SQL Server Analysis Services...

March 22, 2021, 11:38 am

Feed: AWS Big Data Blog. AWS Glue is fully managed service that makes it easier for you to extract, transform, and load (ETL) data for analytics. You can easily create ETL jobs to connect to backend...

View Article

Image may be NSFW.
Clik here to view.

Build a serverless tracking pixel solution in AWS

March 23, 2021, 9:39 am

Feed: AWS Big Data Blog. Let’s describe the typical use case where a tracking pixel solution, also known as a web beacon, might help you: Analyzing web traffic is critical to understanding user...

View Article

Hans-Juergen Schoenig: How the PostgreSQL query optimizer works

March 24, 2021, 1:30 am

Feed: Planet PostgreSQL. Just like any advanced relational database, PostgreSQL uses a cost-based query optimizer that tries to turn your SQL queries into something efficient that executes in as little...

View Article

Hazelcast IMDG 4.2 GA is Released!

March 24, 2021, 2:00 am

Feed: Blog – Hazelcast. Author: Jiri Holusa. It’s a pleasure to announce the general availability of Hazelcast IMDG 4.2! In this blog post, we highlight the most notable features and enhancements in...

View Article

Takayuki Tsunakawa: Faster Bulk Insertion to Foreign Tables – Introduction to...

March 26, 2021, 7:45 am

Feed: Planet PostgreSQL. Introduction Currently, the global PostgreSQL development team, led by Amit Kapila, continues to contribute to the community as part of Fujitsu’s work on PostgreSQL. As a...

View Article

Image may be NSFW.
Clik here to view.

Overview of MySQL Alternative Storage Engines

March 26, 2021, 9:40 am

Feed: Planet MySQL; Author: MySQL Performance Blog; For MySQL, MyISAM and InnoDB storage engines are very popular. Currently, we are mostly using InnoDB engines for high reliability and high...

View Article

Image may be NSFW.
Clik here to view.

Migrate terabytes of data quickly from Google Cloud to Amazon S3 with AWS...

March 30, 2021, 12:20 pm

Feed: AWS Big Data Blog. The cloud is often seen as advantageous for data lakes because of better security, faster time to deployment, better availability, more frequent feature and functionality...

View Article

Image may be NSFW.
Clik here to view.

Simplify data integration pipeline development using AWS Glue custom blueprints

March 30, 2021, 4:28 pm

Feed: AWS Big Data Blog. Organizations spend significant time developing and maintaining data integration pipelines that hydrate data warehouses, data lakes, and lake houses. As data volume increases,...

View Article

Image may be NSFW.
Clik here to view.

Emil Shkolnik: Is Greenplum Database “just a big sharded PostgreSQL”?

March 31, 2021, 11:02 pm

Feed: Planet PostgreSQL. 29 Mar Is Greenplum Database “just a big sharded PostgreSQL”? Post Views: 641 Introduction What is Greenplum Database? This is on of PostgreSQL forks optimized for OLAP and...

View Article

Image may be NSFW.
Clik here to view.

Table Partitioning In MySQL NDB Cluster And What’s New (Part IV)

April 1, 2021, 12:46 am

Feed: Planet MySQL; Author: Saroj Tripathy; Whats new in NDB Cluster 8.0 version (8.0.23) With new configuration variables introduced in NDB cluster version 8.0.23, user now have more control in table...

View Article

Image may be NSFW.
Clik here to view.

Automate dynamic mapping and renaming of column names in data files using AWS...

April 1, 2021, 2:51 am

Feed: AWS Big Data Blog. In Part 1 of this two-part post, we looked at how we can create an AWS Glue ETL job that is agnostic enough to rename columns of a data file by mapping to column names of...

View Article

Image may be NSFW.
Clik here to view.

How 1Strategy simplified their spreadsheet ETL process using AWS Glue DataBrew

April 1, 2021, 2:51 am

Feed: AWS Big Data Blog. This is a guest blog post by Pat Reilly and Gary Houk at 1Strategy. In their own words, “1Strategy is an APN Premier Consulting Partner focusing exclusively on AWS solutions....

View Article

Image may be NSFW.
Clik here to view.

Easily ingest and analyze Google Analytics data with Upsolver and Amazon AppFlow

April 1, 2021, 2:52 am

Feed: AWS Big Data Blog. This post is co-written by Mei Long at Upsolver. Software as a service (SaaS) based applications are in demand today, and customers have growing need for adopting many of them...

View Article

Image may be NSFW.
Clik here to view.

Journey to Adopt Cloud-Native Architecture Series: #2 – Maximizing System...

April 1, 2021, 3:41 am

Feed: AWS Architecture Blog. In the last blog, Preparing your Applications for Hypergrowth, we talked about hypergrowth and the technical challenges it presents to companies. As a reminder, we...

View Article

Amazon EC2 now allows you to copy Amazon Machine Images across AWS GovCloud,...

April 6, 2021, 2:04 pm

Feed: Recent Announcements. You can now quickly and conveniently copy Amazon Machine Images (AMIs) to AWS GovCloud (US) Regions, AWS China Regions and other AWS regions (also known as partitions) to...

View Article

Image may be NSFW.
Clik here to view.

Processing satellite imagery with serverless architecture

April 7, 2021, 8:02 am

Feed: AWS Compute Blog. Author: James Beswick. This post was written by Justin Downes, Machine Learning Consultant. The amount of satellite imagery publicly available is growing and images from...

View Article