Which of the following layers of the medallion architecture is most commonly used by data analysts?
ANone of these layers are used by data analysts
BGold
CAll of these layers are used equally by data analysts
DSilver
EBronze
A data analyst needs to use the Databricks Lakehouse Platform to quickly create SQL queries and data visualizations. It is a requirement that the compute resources in the platform can be made serverless, and it is expected that data visualizations can be placed within a dashboard.
Which of the following Databricks Lakehouse Platform services/capabilities meets all of these requirements?
ADelta Lake
BDatabricks Notebooks
CTableau
DDatabricks Machine Learning
EDatabricks SQL
A data analyst is attempting to drop a table my_table. The analyst wants to delete all table metadata and data.
They run the following command:
DROP TABLE IF EXISTS my_table;
While the object no longer appears when they run SHOW TABLES, the data files still exist.
Which of the following describes why the data files still exist and the metadata files were deleted?
AThe table's data was larger than 10 GB
BThe table did not have a location
CThe table was external
DThe table's data was smaller than 10 GB
EThe table was managed
After running DESCRIBE EXTENDED accounts.customers;, the following was returned:
Now, a data analyst runs the following command:
DROP accounts.customers;
Which of the following describes the result of running this command?
ARunning SELECT * FROM delta. dbfs:/stakeholders/customers results in an error.
BRunning SELECT * FROM accounts.customers will return all rows in the table.
CAll files with the .customers extension are deleted.
DThe accounts.customers table is removed from the metastore, and the underlying data files are deleted.
EThe accounts.customers table is removed from the metastore, but the underlying data files are untouched.
Which of the following should data analysts consider when working with personally identifiable information (PII) data?
AOrganization-specific best practices for PII data
BLegal requirements for the area in which the data was collected
CNone of these considerations
DLegal requirements for the area in which the analysis is being performed
EAll of these considerations
Delta Lake stores table data as a series of data files, but it also stores a lot of other information.
Which of the following is stored alongside data files when using Delta Lake?
ANone of these
BTable metadata, data summary visualizations, and owner account information
CTable metadata
DData summary visualizations
EOwner account information
Which of the following is an advantage of using a Delta Lake-based data lakehouse over common data lake solutions?
AACID transactions
BFlexible schemas
CData deletion
DScalable storage
EOpen-source formats
Which of the following benefits of using Databricks SQL is provided by Data Explorer?
AIt can be used to run UPDATE queries to update any tables in a database.
BIt can be used to view metadata and data, as well as view/change permissions.
CIt can be used to produce dashboards that allow data exploration.
DIt can be used to make visualizations that can be shared with stakeholders.
EIt can be used to connect to third party BI cools.
The stakeholders.customers table has 15 columns and 3,000 rows of data. The following command is run:
After running SELECT * FROM stakeholders.eur_customers, 15 rows are returned. After the command executes completely, the user logs out of Databricks.
After logging back in two days later, what is the status of the stakeholders.eur_customers view?
AThe view remains available and SELECT * FROM stakeholders.eur_customers will execute correctly.
BThe view has been dropped.
CThe view is not available in the metastore, but the underlying data can be accessed with SELECT * FROM delta. stakeholders.eur_customers.
DThe view remains available but attempting to SELECT from it results in an empty result set because data in views are automatically deleted after logging out.
EThe view has been converted into a table.
A data analyst created and is the owner of the managed table my_ table. They now want to change ownership of the table to a single other user using Data Explorer.
Which of the following approaches can the analyst use to complete the task?
AEdit the Owner field in the table page by removing their own account
BEdit the Owner field in the table page by selecting All Users
CEdit the Owner field in the table page by selecting the new owner's account
DEdit the Owner field in the table page by selecting the Admins group
EEdit the Owner field in the table page by removing all access
A data analyst has a managed table table_name in database database_name. They would now like to remove the table from the database and all of the data files associated with the table. The rest of the tables in the database must continue to exist.
Which of the following commands can the analyst use to complete the task without producing an error?
ADROP DATABASE database_name;
BDROP TABLE database_name.table_name;
CDELETE TABLE database_name.table_name;
DDELETE TABLE table_name FROM database_name;
EDROP TABLE table_name FROM database_name;
A data analyst has recently joined a new team that uses Databricks SQL, but the analyst has never used Databricks before. The analyst wants to know where in Databricks SQL they can write and execute SQL queries.
On which of the following pages can the analyst write and execute SQL queries?
AData page
BDashboards page
CQueries page
DAlerts page
ESQL Editor page
A data analyst runs the following command:
SELECT age, country -
FROM my_table -
WHERE age >= 75 AND country = 'canada';
Which of the following tables represents the output of the above command?
A
B
C
D
E
A data analyst runs the following command:
INSERT INTO stakeholders.suppliers TABLE stakeholders.new_suppliers;
What is the result of running this command?
AThe suppliers table now contains both the data it had before the command was run and the data from the new_suppliers table, and any duplicate data is deleted.
BThe command fails because it is written incorrectly.
CThe suppliers table now contains both the data it had before the command was run and the data from the new_suppliers table, including any duplicate data.
DThe suppliers table now contains the data from the new_suppliers table, and the new_suppliers table now contains the data from the suppliers table.
EThe suppliers table now contains only the data from the new_suppliers table.
A business analyst has been asked to create a data entity/object called sales_by_employee. It should always stay up-to-date when new data are added to the sales table. The new entity should have the columns sales_person, which will be the name of the employee from the employees table, and sales, which will be all sales for that particular sales person. Both the sales table and the employees table have an employee_id column that is used to identify the sales person.
Which of the following code blocks will accomplish this task?
A
B
C
D
E
A data analyst has been asked to use the below table sales_table to get the percentage rank of products within region by the sales:
The result of the query should look like this:
Which of the following queries will accomplish this task?