Download (right-click, save target as ...) this page as a Jupyterlab notebook from: ES-1


CE 5319 Machine Learning for Civil Engineers
Fall 2022 Exercise Set 2

LAST NAME, FIRST NAME

R00000000


Purpose :

Obtain databases for use in class and homework exercises.

Assessment Criteria :

Completion, results plausible, format correct, calculations (Jupyter Notebook) are shown.


Problem 1

Obtain the Concrete Strength Database and perform a rudimentary exploratory data analysis and summary.

The source database is at: https://archive.ics.uci.edu/ml/machine-learning-databases/concrete/compressive/Concrete_Data.xls Observe it is an Excel file, so you will need to use pandas to read it into a dataframe.


Problem 2

Obtain the National Bridge Inventory Database and perform a rudimentary data analysis and content summary.

The source database is at: https://www.fhwa.dot.gov/bridge/nbi/2021allstatesallrecsdel.zip Notice the URL is to a zip file, after download you have to extract the file.

Determine which state code is Texas.


Problem 3

Mirror the Crack Images Database Directory.

The container directory is at: http://54.243.252.9/ce-5319-webroot/1-Databases/MaterialCracking/

The literature document from which this data arises is Machine Learning Classifiers for Surface Crack Detection in Fracture Experiments


Problem 4

Obtain the two databases related to forest fires; the container directories are:

  1. https://archive-beta.ics.uci.edu/ml/datasets/forest+fires
  2. https://archive-beta.ics.uci.edu/ml/datasets/algerian+forest+fires+dataset

Prepare written description of these two datasets, what do they contain, and can they be merged into a larger single database (i.e. what are the common features, maybe with different names)

In [ ]: