Quantcast
Channel: partitions – Cloud Data Architect
Browsing latest articles
Browse All 413 View Live

Amazon MSK now supports the ability to change the size or family of your...

Feed: Recent Announcements. You can now scale your Amazon Managed Streaming for Apache Kafka (MSK) clusters on demand by changing the size or family of your brokers without reassigning Apache Kafka...

View Article


Image may be NSFW.
Clik here to view.

Storing Multi-Tenant SaaS Data in a Serverless Environment with Amazon Keyspaces

Feed: AWS Partner Network (APN) Blog. Author: Tod Golding. By Tod Golding, Principal Partner Solutions Architect, AWS SaaS FactoryBy Muhammad Sadiqain Tahir, Contributing Writer With the introduction...

View Article


Image may be NSFW.
Clik here to view.

Snowflake Cloud Data Platform Architecture & Basic Concepts

Feed: Matillion. Author: Julie Polito;   As big data continues to get bigger, more organizations are turning to cloud data warehouses. The cloud is the only platform that provides the flexibility and...

View Article

Image may be NSFW.
Clik here to view.

How EMX reduced data pipeline costs by 85% with Amazon Athena

Feed: AWS Big Data Blog. This is a guest blog post by Gary Bouton and Louis Ashner from EMX. In their own words, “ENGINE Media Exchange (EMX) is a leading marketing technology company, leveraging a...

View Article

Image may be NSFW.
Clik here to view.

How Baqend built a real-time web analytics platform using Amazon Kinesis Data...

Feed: AWS Big Data Blog. This is a customer post written by the engineers from German startup Baqend and the AWS EMEA Prototyping Labs team. Baqend is one of the fastest-growing software as a service...

View Article


Image may be NSFW.
Clik here to view.

Automating AWS service logs table creation and querying them with Amazon Athena

Feed: AWS Big Data Blog. I was working with a customer who was just getting started using AWS, and they wanted to understand how to query their AWS service logs that were being delivered to Amazon...

View Article

Andreas ‘ads’ Scherbaum: Hubert Lubaczewski

Feed: Planet PostgreSQL. Date: 08.02.2021 Tags:   postgresql    explain    perl    postgresql    psql    linux    Category:   Interviews    Interviewed by: Andreas Scherbaum PostgreSQL is the World’s...

View Article

Halil Ozan Akgul: Citus Tips: How to undistribute a distributed Postgres table

Feed: Planet PostgreSQL. This post by Halil Ozan Akgul about Citus was originally published on the Azure Database for PostgreSQL Blog on Microsoft TechCommunity. Once you start using the Citus...

View Article


Image may be NSFW.
Clik here to view.

Querying a Vertica data source in Amazon Athena using the Athena Federated...

Feed: AWS Big Data Blog. The ability to query data and perform ad hoc analysis across multiple platforms and data stores with a single tool brings immense value to the big data analytical arena. As...

View Article


Image may be NSFW.
Clik here to view.

ahsan hadi: Exploring SQL command changes in PG-14

Feed: Planet PostgreSQL. I have spent sometime exploring all the SQL command changes in PostgreSQL 14, this blog will provide a brief summary about the SQL changes along with some examples. There are...

View Article

Ryan Lambert: Why Partition OpenStreetMap data?

Feed: Planet PostgreSQL. By Ryan Lambert — Published February 14, 2021 This post covers the first part of my path in considering native Postgres partitioning and how it might be helpful to my work with...

View Article

Image may be NSFW.
Clik here to view.

Some computational redistricting methods: or, how to sniff out a gerrymander...

Feed: R-bloggers. Author: Jason Timm. [This article was first published on Jason Timm, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here) Want to share...

View Article

Ryan Lambert: Partition OpenStreetMap data in PostGIS

Feed: Planet PostgreSQL. By Ryan Lambert — Published February 16, 2021 This post continues my quest to explore Postgres native partitioning and determine if it is a good fit for my OpenStreetMap data...

View Article


AWS Glue Studio jobs can now update AWS Glue Data Catalog tables

Feed: Recent Announcements. AWS Glue Studio now supports updating the AWS Glue Data Catalog during job runs. This feature makes it easy to keep your tables up to date as AWS Glue writes new data into...

View Article

Hans-Juergen Schoenig: PostgreSQL: CREATE STATISTICS – advanced query...

Feed: Planet PostgreSQL. PostgreSQL query optimization with CREATE STATISTICS is an important topic. Usually, the PostgreSQL optimizer (query planner) does an excellent job. This is not only true for...

View Article


Image may be NSFW.
Clik here to view.

Amazon MSK backup for Archival, Replay, or Analytics

Feed: AWS Architecture Blog. Amazon MSK is a fully managed service that helps you build and run applications that use Apache Kafka to process streaming data. Apache Kafka is an open-source platform for...

View Article

Image may be NSFW.
Clik here to view.

Create a custom data connector to Slack’s Member Analytics API in Amazon...

Feed: AWS Big Data Blog. Amazon QuickSight recently added support for Amazon Athena Federated Query, which allows you to query data in place from various data sources. With this capability, QuickSight...

View Article


Ryan Lambert: First Review of Partitioning OpenStreetMap

Feed: Planet PostgreSQL. By Ryan Lambert — Published February 21, 2021 My previous two posts set the stage to evaluate declarative Postgres partitioning for OpenStreetMap data. This post outlines what...

View Article

Andreas ‘ads’ Scherbaum: Takayuki Tsunakawa

Feed: Planet PostgreSQL. Date: 22.02.2021 Tags:   postgresql    japan    contributor    fujitsu    video-game    virtual-reality    visually-impaired    Category:   Interviews    Interviewed by:...

View Article

Image may be NSFW.
Clik here to view.

Run usage analytics on Amazon QuickSight using AWS CloudTrail

Feed: AWS Big Data Blog. Amazon QuickSight is a cloud-native BI service that allows end users to create and publish dashboards in minutes, without provisioning any servers or requiring complex...

View Article

Image may be NSFW.
Clik here to view.

Evolving Schemaless into a Distributed SQL Database

Feed: Planet MySQL; Author: Ovais Tariq; In 2016 we published blog posts (I, II) about Schemaless – Uber Engineering’s Scalable Datastore. We went over the design of Schemaless as well as explained the...

View Article


Image may be NSFW.
Clik here to view.

Table Partitioning In MySQL NDB Cluster and What’s New (Part I)

Feed: Planet MySQL; Author: Saroj Tripathy; This blog is about table partitioning in NDB Cluster. We will see how this feature has been enhanced from version to version. Also we will see which...

View Article


Image may be NSFW.
Clik here to view.

Effective data lakes using AWS Lake Formation, Part 1: Getting started with...

Feed: AWS Big Data Blog. Thousands of customers are building their data lakes on Amazon Simple Storage Service (Amazon S3). You can use AWS Lake Formation to build your data lakes easily—in a matter of...

View Article

Image may be NSFW.
Clik here to view.

Build a data lake using Amazon Kinesis Data Streams for Amazon DynamoDB and...

Feed: AWS Big Data Blog. Amazon DynamoDB helps you capture high-velocity data such as clickstream data to form customized user profiles and online order transaction data to develop customer order...

View Article

Marco Slot: Citus 10: Columnar for Postgres, rebalancer, single-node, & more

Feed: Planet PostgreSQL. Development on Citus first started around a decade ago and once a year we release a major new Citus open source version. We wanted to make number 10 something special, but I...

View Article


Jeff Davis: Citus 10 brings columnar compression to Postgres

Feed: Planet PostgreSQL. Citus 10 is out! Check out the Citus 10 blog post for all the details. Citus is an open source extension to Postgres (not a fork) that enables scale-out, but offers other great...

View Article

Image may be NSFW.
Clik here to view.

Ingest Salesforce data into Amazon S3 using the CData JDBC custom connector...

Feed: AWS Big Data Blog. Organizations that successfully generate business value from their data will outperform their peers. Many AWS customers require a data storage and analytics solution that...

View Article

Image may be NSFW.
Clik here to view.

RonDB, automatic thread configuration

Feed: Planet MySQL; Author: Mikael Ronström; This blog introduces how RonDB handles automatic thread configuration. This blog is more technical and dives deeper under the surface of how RonDB operates....

View Article

Image may be NSFW.
Clik here to view.

Archiving Amazon MSK Data to Amazon S3 with the Lenses.io S3 Kafka Connect...

Feed: AWS Partner Network (APN) Blog. Author: Darragh O’Flanagan. By Darragh O’Flanagan, Sr. Partner Solutions Architect at AWSBy Guillaume Aymé, Director Marketing, at Lenses.io As a stream processing...

View Article



Image may be NSFW.
Clik here to view.

Table Partitioning In MySQL NDB Cluster And What’s New (Part II)

Feed: Planet MySQL; Author: Saroj Tripathy; Whats new in NDB Cluster 7.5 version In this version, users have more flexible ways of table partitioning rather than the default way thru ldm. Now user can...

View Article

Image may be NSFW.
Clik here to view.

New features from Apache Hudi available in Amazon EMR

Feed: AWS Big Data Blog. Apache Hudi is an open-source data management framework used to simplify incremental data processing and data pipeline development by providing record-level insert, update and...

View Article

Image may be NSFW.
Clik here to view.

Automating dynamic mapping and renaming of column names in data files using...

Feed: AWS Big Data Blog. In Part 1 of this two-part post, we looked at how we can create an AWS Glue ETL job that is agnostic enough to rename columns of a data file by mapping to column names of...

View Article

Image may be NSFW.
Clik here to view.

Designing a Thread Pipeline for optimal database throughput with high IPC and...

Feed: Planet MySQL; Author: Mikael Ronström;  There are a couple of questions about the blog post on automatic thread configuration in RonDB. Rather than providing an extensive answer in the comment...

View Article


Image may be NSFW.
Clik here to view.

Table partitioning in MySQL NDB Cluster and what’s new (Part III)

Feed: Planet MySQL; Author: Saroj Tripathy; Whats new in NDB Cluster 7.5 version (Contd.) In cluster 7.5 the READ_BACKUP and FULLY_REPLICATED table features were added. These features are both designed...

View Article

Image may be NSFW.
Clik here to view.

Extract multidimensional data from Microsoft SQL Server Analysis Services...

Feed: AWS Big Data Blog. AWS Glue is fully managed service that makes it easier for you to extract, transform, and load (ETL) data for analytics. You can easily create ETL jobs to connect to backend...

View Article

Image may be NSFW.
Clik here to view.

Build a serverless tracking pixel solution in AWS

Feed: AWS Big Data Blog. Let’s describe the typical use case where a tracking pixel solution, also known as a web beacon, might help you: Analyzing web traffic is critical to understanding user...

View Article


Hans-Juergen Schoenig: How the PostgreSQL query optimizer works

Feed: Planet PostgreSQL. Just like any advanced relational database, PostgreSQL uses a cost-based query optimizer that tries to turn your SQL queries into something efficient that executes in as little...

View Article


Hazelcast IMDG 4.2 GA is Released!

Feed: Blog – Hazelcast. Author: Jiri Holusa. It’s a pleasure to announce the general availability of Hazelcast IMDG 4.2! In this blog post, we highlight the most notable features and enhancements in...

View Article

Takayuki Tsunakawa: Faster Bulk Insertion to Foreign Tables – Introduction to...

Feed: Planet PostgreSQL. Introduction Currently, the global PostgreSQL development team, led by Amit Kapila, continues to contribute to the community as part of Fujitsu’s work on PostgreSQL. As a...

View Article

Image may be NSFW.
Clik here to view.

Overview of MySQL Alternative Storage Engines

Feed: Planet MySQL; Author: MySQL Performance Blog; For MySQL, MyISAM and InnoDB storage engines are very popular. Currently, we are mostly using InnoDB engines for high reliability and high...

View Article

Image may be NSFW.
Clik here to view.

Migrate terabytes of data quickly from Google Cloud to Amazon S3 with AWS...

Feed: AWS Big Data Blog. The cloud is often seen as advantageous for data lakes because of better security, faster time to deployment, better availability, more frequent feature and functionality...

View Article


Image may be NSFW.
Clik here to view.

Simplify data integration pipeline development using AWS Glue custom blueprints

Feed: AWS Big Data Blog. Organizations spend significant time developing and maintaining data integration pipelines that hydrate data warehouses, data lakes, and lake houses. As data volume increases,...

View Article

Image may be NSFW.
Clik here to view.

Emil Shkolnik: Is Greenplum Database “just a big sharded PostgreSQL”?

Feed: Planet PostgreSQL. 29 Mar Is Greenplum Database “just a big sharded PostgreSQL”? Post Views: 641 Introduction What is Greenplum Database? This is on of PostgreSQL forks optimized for OLAP and...

View Article


Image may be NSFW.
Clik here to view.

Table Partitioning In MySQL NDB Cluster And What’s New (Part IV)

Feed: Planet MySQL; Author: Saroj Tripathy; Whats new in NDB Cluster 8.0 version (8.0.23) With new configuration variables introduced in NDB cluster version 8.0.23, user now have more control in table...

View Article

Image may be NSFW.
Clik here to view.

Automate dynamic mapping and renaming of column names in data files using AWS...

Feed: AWS Big Data Blog. In Part 1 of this two-part post, we looked at how we can create an AWS Glue ETL job that is agnostic enough to rename columns of a data file by mapping to column names of...

View Article


Image may be NSFW.
Clik here to view.

How 1Strategy simplified their spreadsheet ETL process using AWS Glue DataBrew

Feed: AWS Big Data Blog. This is a guest blog post by Pat Reilly and Gary Houk at 1Strategy. In their own words, “1Strategy is an APN Premier Consulting Partner focusing exclusively on AWS solutions....

View Article

Image may be NSFW.
Clik here to view.

Easily ingest and analyze Google Analytics data with Upsolver and Amazon AppFlow

Feed: AWS Big Data Blog. This post is co-written by Mei Long at Upsolver.  Software as a service (SaaS) based applications are in demand today, and customers have growing need for adopting many of them...

View Article

Image may be NSFW.
Clik here to view.

Journey to Adopt Cloud-Native Architecture Series: #2 – Maximizing System...

Feed: AWS Architecture Blog. In the last blog, Preparing your Applications for Hypergrowth, we talked about hypergrowth and the technical challenges it presents to companies. As a reminder, we...

View Article

Amazon EC2 now allows you to copy Amazon Machine Images across AWS GovCloud,...

Feed: Recent Announcements. You can now quickly and conveniently copy Amazon Machine Images (AMIs) to AWS GovCloud (US) Regions, AWS China Regions and other AWS regions (also known as partitions) to...

View Article


Image may be NSFW.
Clik here to view.

Processing satellite imagery with serverless architecture

Feed: AWS Compute Blog. Author: James Beswick. This post was written by Justin Downes, Machine Learning Consultant. The amount of satellite imagery publicly available is growing and images from...

View Article

Browsing latest articles
Browse All 413 View Live