Search results
Results from the WOW.Com Content Network
80 high-resolution aerial images with spatial resolution ranging from 0.3 to 1.0. Images manually segmented. 80 Images Aerial Classification, object detection 2013 [156] [157] J. Yuan et al. KIT AIS Data Set Multiple labeled training and evaluation datasets of aerial images of crowds. Images manually labeled to show paths of individuals through ...
Kaggle is a data science competition platform and online community for data scientists and machine learning practitioners under Google LLC.Kaggle enables users to find and publish datasets, explore and build models in a web-based data science environment, work with other data scientists and machine learning engineers, and enter competitions to solve data science challenges.
A 3.1 TB dataset consisting of permissively licensed source code in 30 programming languages. Filtered through license detection and deduplication. 6 TB, 51.76B files (prior to deduplication); 3 TB, 5.28B files (after). 358 programming languages. Parquet Language modeling, autocompletion, program synthesis. 2022 [402] [403]
The Database of Original & Non-Theoretical Uses of Topology (DONUT) is a database of scholarly articles featuring practical applications of topological data analysis to various areas of science. DONUT was started in 2017 by Barbara Giunti, Janis Lazovskis, and Bastian Rieck, [ 126 ] and as of October 2023 currently contains 447 articles. [ 127 ]
The set of images in the MNIST database was created in 1994. Previously, NIST released two datasets: Special Database 1 (NIST Test Data I, or SD-1); and Special Database 3 (or SD-2). They were released on two CD-ROMs. SD-1 was the test set, and it contained digits written by high school students, 58,646 images written by 500 different writers.
Google Dataset Search is a search engine from Google that helps researchers locate online data that is freely available for use. [1] The company launched the service on September 5, 2018, and stated that the product was targeted at scientists and data journalists. The service was out of beta as of January 23, 2020. [2]
Spark Core is the foundation of the overall project. It provides distributed task dispatching, scheduling, and basic I/O functionalities, exposed through an application programming interface (for Java, Python, Scala, .NET [16] and R) centered on the RDD abstraction (the Java API is available for other JVM languages, but is also usable for some other non-JVM languages that can connect to the ...
The above diagram shows which Linking Open Data datasets are connected, as of August 2014. This was produced by the Linked Open Data Cloud project, which was started in 2007. Some sets may include copyrighted data which is freely available. [15] The same diagram as above, but for February 2017, showing the growth in just two and a half years