Learning Google BigQuery

Author: Eric Brown   Thirukkumaran Haridass   Jason Morris   Mikhail Berlyant   Ruben Oliva Ramos  

Publisher: Packt Publishing‎

Publication year: 2017

E-ISBN: 9781787286290

P-ISBN(Paperback): 89543100306050

Subject: TN919.5 数据处理系统及设备;TP274 数据处理、数据处理系统

Language: ENG

Access to resources Favorite

Disclaimer: Any content in publications that violate the sovereignty, the constitution or regulations of the PRC is not accepted or approved by CNPIEC.

Learning Google BigQuery

Chapter

Chapter 1: Google Cloud and Google BigQuery

Getting started with Google Cloud

Overviewing Google Cloud Platform services

Google Cloud storage and its features

Learning Google BigQuery

Working with the browser

Running your first query

BigQuery public datasets

Getting started with Cloud SQL 

Cloud Datastore

Google App engine

App engine standard environment

App engine flexible environment

Google container engine

Google compute engine

Summary

Chapter 2: Google Cloud SDK

Installing Google Cloud SDK

Installing Google Cloud SDK on Windows

Installing Google Cloud SDK on macOS

Installing Google Cloud SDK on Linux

gsutil for Google Cloud Storage

Using the bq utility for BigQuery

Using the gcloud utility

Connecting to Cloud SQL using gcloud

Authorizing the client machine via Google Cloud Console

Connecting using a proxy script

Exporting Cloud SQL databases and tables

Deploying to Google App Engine

Summary

Chapter 3: Google BigQuery Data Types

Supported data types

Data type considerations

Converting data

Sanitizing data

When to transform your data? Before or after loading to BigQuery?

Arithmetic Operators

Comparison Operators

Date Time Functions

String Functions

Regular Expression Functions

Functions for transformation

Mastering transformation with User-Defined Functions

Some considerations when using UDFs

UDF format

Summary

Further Reading

Chapter 4: BigQuery SQL Basic

The BigQuery interface

Error checking

Querying in BigQuery

Types of queries

Querying public data

Basic SQL syntax

Commenting in BigQuery SQL

SELECT

FROM 

WHERE 

GROUP BY 

ORDER BY

HAVING 

Qualifying tables in query

DISTINCT

BigQuery SQL functions

WITHIN

OMIT RECORD IF

ROLLUP

Joining tables in BigQuery

Inner join

Left Outer join

Right Outer join

Full Outer join

Cross join

UNION,  UNION ALL, and UNION DISTINCT

Adding your own data in BigQuery

Creating a table

Inserting data to a table

Updating data in a table

Resetting a value

Deleting data from a table

Summary

Further reading

Chapter 5: BigQuery SQL Advanced

Partition tables

Creating a partition table using a GUI

Creating a partition table using Google Cloud SDK

Querying data in a partition table

Using partition tables in your projects

Querying external data sources using BigQuery

Creating the table definition

Querying data from external data sources

Wildcard tables

User-defined functions

Views

Querying nested and repeated records

Summary

Further reading

Chapter 6: Google BigQuery API

Accessing Google BigQuery

Introducing Google APIs explorer

Getting credentials for API access

Creating a service account

Programming with BigQuery API in C# .NET

Authenticating the service account

Listing all datasets and all tables in the project

Creating a new dataset in the project

Creating a new table within a dataset

Loading data from a file in Google Cloud Storage to a BigQuery table

Executing a query and displaying the result

Executing the query and saving the result in a new table

Streaming insert of rows

Programming with BigQuery API in Python

Listing all datasets and all tables in the project

Creating a new dataset in the project

Creating a new table within a dataset

Importing data from a file in Google Cloud Storage to a BigQuery table

Executing a query and displaying the result

Execute query and copy results to a new table

Streaming insert of rows

Roles and permissions

Summary

Chapter 7: Visualizing BigQuery Data

Why is data visualization important?

The danger of summary statistics

Making data visualization work for you

Three tools for visualizing BigQuery data

Simple yet basic – Google Data Studio

Getting started

Making a scatterplot in Data Studio

Making a map in Data Studio

Other features of Data Studio

Simple, fairly flexible, but with a cost – Tableau

Getting started

Map charts in Tableau

Create a word cloud in Tableau

Complex but with considerable flexibility – the R programming language

Getting started

Summary

Chapter 8: Google Cloud Pub/Sub

Introduction

Getting started with Cloud Pub/Sub

Cloud Pub/Sub via Google Cloud Console

Cloud Pub/Sub via Google Cloud SDK

Cloud Pub/Sub pricing

Message output formats

Importing message data into BigQuery

Google Cloud Dataprep

Summary

Further reading

Index

The users who browse this book also browse


No browse record.