GithubHelp home page GithubHelp logo

cs120_lab2_linear_regression_df.py - randomSplit changed result in Databrick may cause inconsistent test cases about mooc-setup HOT 2 CLOSED

spark-mooc avatar spark-mooc commented on August 14, 2024
cs120_lab2_linear_regression_df.py - randomSplit changed result in Databrick may cause inconsistent test cases

from mooc-setup.

Comments (2)

brett--anderson avatar brett--anderson commented on August 14, 2024

Adding to this, I'm trying to complete the archived course on edx. I'm getting n_train: 5405, n_val: 644, n_test: 675

All prior tests in the lab are passing. I've read through the piazza discussion, and though a similar problem was reported here, none of the suggestions have helped. I've also tried rewriting my answers to prior sections in numerous ways and I still end up with the same partitioning.

The suggested fix might get the test in question to pass, but subsequent tests which check averages in each set fail and presumably much of the rest of the lab will have mismatched results.

Perhaps this should be changed to something that isn't actually random, some previously saved random set of indexes that can be loaded from a txt file or something?

from mooc-setup.

micksatana avatar micksatana commented on August 14, 2024

It's been a long time since I posted this issue. I can't really remember what it is now.

In general, I quite agree with Brett on test cases shouldn't be random. But if, for whatever reason, the test needed to have random value, then all the subsequest tests should also be fixed. Or maybe re-design the lab ;)

Anyway, I'll close this issue. It seems the course owner is not interested enough to fix it.

from mooc-setup.

Related Issues (11)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.