Skip to main content

Glossary

Glossary of Terms associated with databricks

 

blob storage - Azure term for inexpensive storage of any type of file.  Amazon term is S3

compute - one or more computers (virtual machines) used to perform work such as ingesting data or querying data.

control plane - the "brains" of DBX.  The DBX UI is in the control plane and issues commands to the node(s) in the data plane

data plane - location of the user's data - typically in blob storage container in an Azure storage account

DBX - abbreviation of databricks

parquet files - columnar data file format that is compressed.  Contains column headers, data types, and some metadata.

S3 - Amazon term for inexpensive storage of any type of file.  Azure term is blobs or blob storage