Difference between revisions of "Step 1"

From Research management course
Jump to: navigation, search
Line 45: Line 45:
 
==Papers to choose from==  
 
==Papers to choose from==  
 
Please read a paper from this list and formulate its main message. Imagine you are a journal editor or a reliever, who receives scientific papers randomly and pick up some paper.  
 
Please read a paper from this list and formulate its main message. Imagine you are a journal editor or a reliever, who receives scientific papers randomly and pick up some paper.  
 +
# Distinguishing time-delayed causal interactions using convergent cross mapping [https://doi.org/10.1038/srep14750 DOI]
 +
# Comprehensive study of feature selection methods to solve multicollinearity problem according to evaluation criteria [https://doi.org/10.1016/j.eswa.2017.01.048 DOI], [https://m1p.org/papers/Katrutsa2016QPFeatureSelection.pdf PDF]
 
#  
 
#  
 
#  
 
#  
Line 52: Line 54:
 
#  
 
#  
 
#  
 
#  
#  
+
# How much does it help to know what she knows you know? An agent-based simulation study [http://dx.doi.org/10.1016/j.artint.2013.05.004 DOI]
#
 
#
 
  
 
===Can I select a paper of my own choice?===
 
===Can I select a paper of my own choice?===

Revision as of 01:58, 20 September 2024

The most important things come first. We discuss the main message, delivered by a scientific paper. We explore the first three elements of a scientific paper: the abstract, the highlights, and the keywords. The main message shall reveal itself through all elements of the paper. But we leave the rest of it for the next time. Namely, the title, introduction, problem statement, goal of the computational experiment, and conclusion are left behind. We select a paper and exercise in the reconstruction of these three elements.

The seminar

  1. The warm-up 3-minute test
  2. Model, Algorithm, Method: Machine learning in a nut-shell
  3. Step 0 homework results discussion
  4. Step 1 homework, how to read (how to search is a separate topic)
  5. Structure of the main message
  6. Structure of the abstract
  7. The second and the last slide of your talk
  8. Extracting keywords
  9. Highlights: compressing the paper
  10. Instastructure for your homework
    • GitHub: organize the repository
    • LaTeX: compile your file and commit without temporary files
  11. The papers to select from
  12. Optional GPT-role discussion

Resources

Step 1 Youtube video (expected with online version)

Homework

  1. Set up your GitHub repository using this template, see how
  2. Select a paper to read from the list below
  3. Reconstruct its
    1. Abstract
    2. Keywords
    3. Highlights
    4. Short motivation for why you selected this paper (no templates here, since it is an extra topic to discuss)
  4. Compile and upload TEX and PDF to GitHub (no temporary files, please)
  5. Linear models for the next warm-up test, either
    1. look for the terms dot product, scalar projection, least squares, linear map
    2. or do fun-reading, the pages 33-39 from the book Section L3.

Note that we always respect your credit hours. So please keep track of it.

Your profit here is your ability to find the main message of a paper.

Papers to choose from

Please read a paper from this list and formulate its main message. Imagine you are a journal editor or a reliever, who receives scientific papers randomly and pick up some paper.

  1. Distinguishing time-delayed causal interactions using convergent cross mapping DOI
  2. Comprehensive study of feature selection methods to solve multicollinearity problem according to evaluation criteria DOI, PDF
  3. How much does it help to know what she knows you know? An agent-based simulation study DOI

Can I select a paper of my own choice?

Yes. Here are some formal requirements.

  1. A clear message in the area of Theoretical Foundations of Machine Learning.
  2. No Kaggle-style papers with messages like "It works, but nobody knows how".
  3. Top peer-reviewed journals, no ArXiv, better avoid conferences.
  4. No papers from other fields: linguistics, medicine, finance, physics, etc. There must be only one primary subject: Machine Learning.
  5. No overviews of paper collections, it is another genre.
  6. No predatory publishing houses

Recommended journals

  1. Machine Learning
  2. Expert Systems with Applications
  3. Journal of Machine Learning Research
  4. Artificial Intelligence
  5. Neurocomputing
  6. IEEE Transactions on Pattern Analysis and Machine Intelligence
  7. Neural Networks
  8. Pattern Recognition
  9. Data Mining and Knowledge Discovery
  10. Nature Machine Inlelligence, the problem is the first word here is Nature so it focuses on natural sciences

See also

  1. Elsevier's open archive
  2. Springer's open archive


Transcript of the video

Appears after the seminar.