Consider two neural networks that map a scalar input x to a scalar output y. The first network is shallow and has D = 95 hidden units. The second is deep and has K = 10 layers, each containing D = 5 hidden units. How many parameters does each network have? How many linear regions can each network make? Which would run faster?

Report
Question

Please briefly explain why you feel this question should be reported.

Report
Cancel

Consider two neural networks that map a scalar input x to a scalar output y. The first network is shallow and
has D = 95 hidden units. The second is deep and has K = 10 layers, each containing D = 5 hidden units. How
many parameters does each network have? How many linear regions can each network make? Which would run
faster?

MathJax Example

Leave an answer

Browse

By answering, you agree to the Terms of Service and Privacy Policy.