July 9, 2019
by Cosmin Laslau
An increasingly common approach within machine learning is representing words, sentences, and documents in high-dimensional vector spaces, in part because it does not require hand-labeling data. Now, materials scientists from Berkley (with backing from Toyota) have shown in a study published in "Nature" that a similar approach can be applied to model scientific knowledge, mining already-published academic papers. Impressively, they were able to show that this approach hints at future discoveries years before they are, in fact, discovered. Having backtested this approach, the next test will be to apply it looking forward to enable new scientific breakthroughs. In the meantime, interested clients can get the code and try it themselves.

For the original news article, click here .

