DuckDB in Action

DuckDB in Action
Author :
Publisher : Simon and Schuster
Total Pages : 310
Release :
ISBN-10 : 9781638355595
ISBN-13 : 1638355592
Rating : 4/5 (95 Downloads)

Book Synopsis DuckDB in Action by : Mark Needham

Download or read book DuckDB in Action written by Mark Needham and published by Simon and Schuster. This book was released on 2024-09-10 with total page 310 pages. Available in PDF, EPUB and Kindle. Book excerpt: Dive into DuckDB and start processing gigabytes of data with ease—all with no data warehouse. DuckDB is a cutting-edge SQL database that makes it incredibly easy to analyze big data sets right from your laptop. In DuckDB in Action you’ll learn everything you need to know to get the most out of this awesome tool, keep your data secure on prem, and save you hundreds on your cloud bill. From data ingestion to advanced data pipelines, you’ll learn everything you need to get the most out of DuckDB—all through hands-on examples. Open up DuckDB in Action and learn how to: • Read and process data from CSV, JSON and Parquet sources both locally and remote • Write analytical SQL queries, including aggregations, common table expressions, window functions, special types of joins, and pivot tables • Use DuckDB from Python, both with SQL and its "Relational"-API, interacting with databases but also data frames • Prepare, ingest and query large datasets • Build cloud data pipelines • Extend DuckDB with custom functionality Pragmatic and comprehensive, DuckDB in Action introduces the DuckDB database and shows you how to use it to solve common data workflow problems. You won’t need to read through pages of documentation—you’ll learn as you work. Get to grips with DuckDB's unique SQL dialect, learning to seamlessly load, prepare, and analyze data using SQL queries. Extend DuckDB with both Python and built-in tools such as MotherDuck, and gain practical insights into building robust and automated data pipelines. About the technology DuckDB makes data analytics fast and fun! You don’t need to set up a Spark or run a cloud data warehouse just to process a few hundred gigabytes of data. DuckDB is easily embeddable in any data analytics application, runs on a laptop, and processes data from almost any source, including JSON, CSV, Parquet, SQLite and Postgres. About the book DuckDB in Action guides you example-by-example from setup, through your first SQL query, to advanced topics like building data pipelines and embedding DuckDB as a local data store for a Streamlit web app. You’ll explore DuckDB’s handy SQL extensions, get to grips with aggregation, analysis, and data without persistence, and use Python to customize DuckDB. A hands-on project accompanies each new topic, so you can see DuckDB in action. What's inside • Prepare, ingest and query large datasets • Build cloud data pipelines • Extend DuckDB with custom functionality • Fast-paced SQL recap: From simple queries to advanced analytics About the reader For data pros comfortable with Python and CLI tools. About the author Mark Needham is a blogger and video creator at @?LearnDataWithMark. Michael Hunger leads product innovation for the Neo4j graph database. Michael Simons is a Java Champion, author, and Engineer at Neo4j.


DuckDB in Action Related Books

DuckDB in Action
Language: en
Pages: 310
Authors: Mark Needham
Categories: Computers
Type: BOOK - Published: 2024-09-10 - Publisher: Simon and Schuster

DOWNLOAD EBOOK

Dive into DuckDB and start processing gigabytes of data with ease—all with no data warehouse. DuckDB is a cutting-edge SQL database that makes it incredibly e
ScyllaDB in Action
Language: en
Pages: 390
Authors: Bo Ingram
Categories: Computers
Type: BOOK - Published: 2024-11-12 - Publisher: Simon and Schuster

DOWNLOAD EBOOK

Build, maintain, and run databases that are easy to scale and quick to query—all with ScyllaDB. ScyllaDB in Action is your guide to everything you need to kno
Getting Started with DuckDB
Language: en
Pages: 382
Authors: Simon Aubury
Categories: Computers
Type: BOOK - Published: 2024-06-24 - Publisher: Packt Publishing Ltd

DOWNLOAD EBOOK

Analyze and transform data efficiently with DuckDB, a versatile, modern, in-process SQL database Key Features Use DuckDB to rapidly load, transform, and query d
Spark: The Definitive Guide
Language: en
Pages: 594
Authors: Bill Chambers
Categories: Computers
Type: BOOK - Published: 2018-02-08 - Publisher: "O'Reilly Media, Inc."

DOWNLOAD EBOOK

Learn how to use, deploy, and maintain Apache Spark with this comprehensive guide, written by the creators of the open-source cluster-computing framework. With
Learning Spark
Language: en
Pages: 289
Authors: Holden Karau
Categories: Computers
Type: BOOK - Published: 2015-01-28 - Publisher: "O'Reilly Media, Inc."

DOWNLOAD EBOOK

Data in all domains is getting bigger. How can you work with it efficiently? Recently updated for Spark 1.3, this book introduces Apache Spark, the open source