Final Project Guidelines
Choosing A Topic
In this project, you have two options for the general route you can take:
- As a first option, you may select a bioengineering paper that implements a computational analysis we have learned about from the literature. You will reimplement this analysis with thorough testing and documentation. In your final report, you should include a discussion about any elements that presented a challenge during your reimplementation, or discrepancies you found. Lastly, you should identify a creative extension of the analysis and implement it.
- Alternatively, you may identify a dataset and corresponding analysis that has not yet been performed. This can be more exploratory in nature. You will implement this analysis with testing and documentation. In your final report, you should discuss whether/how the type of data influenced the analysis that was possible, the findings and limitations of your analysis, and what might be ways to validate your findings. Some repositories with potential datasets are listed below.
There are no restrictions on the programming language or packages you use. Projects can be individual or in teams of two with the scope of the project adjusted accordingly. If in a team, your proposal, final project, and final presentation will be jointly submitted and should cover how the project was divided.
Proposal
The proposal should be one page in length and explain the following:
- Why the topic you chose is interesting and solves a bioengineering-related challenge
- How your project fits one of the two topic criteria listed above
- What overall approach do you plan to take for the project and why
- How your project can be finished within a month
- The difficulty of your project
- The expected outcomes of your analysis
- If in a group, how the project will be split up among members
We are available to discuss your ideas whenever you are ready, and you should discuss your idea with us prior to submitting your proposal. By the time you submit your proposal you should ensure that you have access to the requisite data. We are happy to help you with this.
Final Report
Your final report should be less than 1500 words and describe the following items:
- Introduction/Motivation:
- Why the topic you choose is interesting
- Whether similar work has been done by others (novelty will not affect your grade, but it is always good to know if other people are doing the same thing)
- Problem definition
- How to formulate your topic/question into a data analytics problem
- How does it relate to a broader biomedical challenge
- Methods
- Description of the algorithm you employed or designed
- The software package you chose or your own implementation of the algorithm
- How to use the software package or your own code
- (Optional) Charting and/or visualization that help make decisions in your analysis
- Results
- Quantitative evaluation of your method
- Charting and/or visualization of your results
- How your proposed approach has solved your question
Your analysis code should accompany your final report.
Final Presentation
Each presentation will have 10 minutes:
- 8 minutes for a presentation
- 2 minutes for questions, discussion, and transition to the next presenter
Each team should prepare no more than 8 slides, excluding the title slide, and cover the sections listed in the final report.
Grading
Your project will be evaluated
- Proposal (15%)
- Final Report (40%)
- Final Presentation (15%)
- Code and Implementation (30%)
Final Project Source Ideas
- Many interesting datasets are listed in the UC Irvine Machine Learning Repository.
- Machine learning competitions, such as the DREAM Challenges, Kaggle, and DrivenData, may have relevant challenges and pre-assembled datasets.
- You are welcome to use a dataset from your own research. There is no problem keeping the data confidential, and this does not create any issues related to authorship. If you chose to do so, you should chose a new analysis approach that has not yet been performed.