All code should be ran through the Jupyter Notebook.
- Configure your Jupyter notebook to use Python 3.9.2
- Run the pip install code snippet in the test.ipynb file to install all of the necessary modules. a. Ensure the modules install succesfully, if they do not then look at the error code to resolve.
- Open test.ipynb
- Run first code snippet in to setup autoreload
- Run second code snippet (pip install one) to install packages if not done already
- Run third code snippet to import relevant custom libraries
- Run either fourth or fifth code snippet
- Fourth code snippet has a large dataset for real-life code execution (school_report)
- Fifth code snippet has a small dataset for quick code execution (500spend)
- After running, desired code snippet, output files will be generated showing state of table in each major step
- Final step is the 5 - PostSubsumption... files.
- A CSV with the final table is provided along with a .txt file with general information on the table
- The console output in the notebook should provide the execution time
- The sixth and seventh data set graphs information from individual runs
- Open test_suite.ipynb
- Run the first code snippet to install packages if not done so already
- Run the rest of the code snippets sequentially to obtain run statistics and graphs
Original paper to be implemented - https://www.vldb.org/pvldb/vol16/p932-khatiwada.pdf
Original paper Github repository - https://github.com/northeastern-datalab/alite
Original authors' presentation - https://youtu.be/4c6SYCwQ7uc?si=qE36Hm70qaJAa8Hz
Origin of Full Disjunction? - https://dl.acm.org/doi/10.1145/191839.191908
Other FD paper - https://dl.acm.org/doi/10.1145/237661.237717
IncrementalFD - https://www.sciencedirect.com/science/article/pii/S0022000006001449
ParaFD - https://www.sciencedirect.com/science/article/pii/S2214579618303137
BIComNLoj - https://dl.acm.org/doi/10.5555/1182635.1164191
Use data located here - https://drive.google.com/drive/folders/1yUgL8TjQievzp8zvmHLpa_ClNzc5mTmD