Computers and Technology
Computers and Technology, 20.07.2021 07:50, anggar20

Please complete the following instructions: 1. Download the Chicago_Crimes_Assign_4500Bal. csv download dataset.

2. Create an Orange workflow that will do the following:

Ingest the Chicago_Crimes_Assign_4500Bal. csv download dataset.

Preprocess the data to retrieve the most relevant 2 features.

Continuize the discrete categorical variables as numerical. See the video for which option to select here.

Create a k-Means module and let the number of clusters be chosen by the Silhouette score.

Create a Silhoutte Plot

Create a Scatter Plot and compare the two features. Color by cluster.

Answer the following questions in a Word document:

How many clusters was your final result produced in? What was the Silhouette score of the most optimal cluster sizes?

What were the 2 features chosen by your preprocessing?

What can you say about the scatter plot produced? Think about how the categorical variables are transformed into numbers. You don't need to know what the values are that are encoded to make observations about the relationships between the variables.

Try to switch to a using few different number of clusters? Look at your scatterplot. Does this make more sense or less?

Using the Silhouette Plot at the same time as the Scatter Plot, how many of each cluster are ranked in the bottom of cohesion? You can highlight them to see them on the scatter plot (see video). What can you say about these data points in each cluster?

3. Open your workflow from the Chicago Crimes Classification Assignment. You can choose the Undersampled or Oversampled version (note the Oversampled version will take longer for the Neural Network to train)

Add the Neural Network model widget.

Configure the Neural Network as follows:

Give it 2 hidden layers of 50 neurons in each layer

Make the Activation function: ReLu

Make the Solver: Adam

Regularization: leave as is at 0.0001

Maximal number of iterations: 100

Ensure replicable training is checked

Connect the Neural Network widget to the training data as an input and the Test & Score as output (see video)

Connect the Neural Network output to the Predict widget.

Compare the new results in Test & Score, Confusion Matrix and ROC Score.

In the Word Document you created for clustering, answer these questions:

List the results in the Word Document.

Did the Neural Network model perform better or worse than the other models?

Why do you think it performed better or worse?

Send me your email for the link!

answer
Answers: 2

Other questions on the subject: Computers and Technology

image
Computers and Technology, 22.06.2019 10:30, ilovepickles930
What can tanya do now to start preparing for the college and scholarship application process? think about her grades, activities in which she can get involved, possible part-time jobs at which she can work, and standardized tests she can take. (10 points) apex
Answers: 2
image
Computers and Technology, 22.06.2019 22:00, delawdermia27
Researchers measured the data speeds for a particular smartphone carrier at 50 airports. the highest speed measured was 78.1 mbps. the complete list of 50 data speeds has a mean of x overbarequals16.11 mbps and a standard deviation of sequals18.65 mbps. a. what is the difference between carrier's highest data speed and the mean of all 50 data speeds? b. how many standard deviations is that [the difference found in part (a)]? c. convert the carrier's highest data speed to a z score. d. if we consider data speeds that convert to z scores between minus2 and 2 to be neither significantly low nor significantly high, is the carrier's highest data speed significant? a. the difference is nothing mbps.
Answers: 3
image
Computers and Technology, 23.06.2019 14:30, officialrogerfp3gf2s
Select the correct answer. a company wants to use online methods to target more customers. it decides to conduct a market research by collecting the data of a few customers with their consent. they want to track data of the sites that their customers frequently visit. which software can the company? a. spyware b. bots c. adware d. trojan horse e. rootkits
Answers: 1
image
Computers and Technology, 23.06.2019 17:30, Annlee23
When making changes to optimize part of a processor, it is often the case that speeding up one type of instruction comes at the cost of slowing down something else. for example, if we put in a complicated fast floating-point unit, that takes space, and something might have to be moved farther away from the middle to accommodate it, adding an extra cycle in delay to reach that unit. the basic amdahl's law equation does not take into account this trade-off. a. if the new fast floating-point unit speeds up floating-point operations by, on average, 2ă—, and floating-point operations take 20% of the original program's execution time, what is the overall speedup (ignoring the penalty to any other instructions)? b. now assume that speeding up the floating-point unit slowed down data cache accesses, resulting in a 1.5ă— slowdown (or 2/3 speedup). data cache accesses consume 10% of the execution time. what is the overall speedup now? c. after implementing the new floating-point operations, what percentage of execution time is spent on floating-point operations? what percentage is spent on data cache accesses?
Answers: 2
Do you know the correct answer?
Please complete the following instructions: 1. Download the Chicago_Crimes_Assign_4500Bal. csv down...

Questions in other subjects: