Publication Date
11-26-2019
Resource Type
Dataset
Abstract
This archive contains queries that capture information in different search contexts. The first file includes those written by children between the 3rd - 6th grade levels, while performing search tasks. We collected and archived this data between the April 2017 -- December 2018, based on Boise State University's IRB approval. We also include simulated queries we extracted from children's reviews. Additional columns in this dataset are children's grade levels, the query source, and the query type (i.e., if it is a keyword, phrase, or question query). The other files are comprised of queries that are meant to lead to the retrieval of (1) educational, (2) sexually explicit, and (3) hate-based resources.
DOI
10.18122/cs_scripts/7/boisestate
Use Restrictions
CC-BY-NC-ND
Disclaimer of Warranty
BOISE STATE UNIVERSITY MAKES NO REPRESENTATIONS ABOUT THE SUITABILITY OF THE INFORMATION CONTAINED IN OR PROVIDED AS PART OF THE SYSTEM FOR ANY PURPOSE. ALL SUCH INFORMATION IS PROVIDED "AS IS" WITHOUT WARRANTY OF ANY KIND. BOISE STATE UNIVERSITY HEREBY DISCLAIMS ALL WARRANTIES AND CONDITIONS WITH REGARD TO THIS INFORMATION, INCLUDING ALL WARRANTIES AND CONDITIONS OF MERCHANTABILITY, WHETHER EXPRESS, IMPLIED OR STATUTORY, FITNESS FOR A PARTICULAR PURPOSE, TITLE AND NON-INFRINGEMENT.
IN NO EVENT SHALL BOISE STATE UNIVERSITY BE LIABLE FOR ANY SPECIAL, INDIRECT OR CONSEQUENTIAL DAMAGES OR ANY DAMAGES WHATSOEVER RESULTING FROM LOSS OF USE, DATA OR PROFITS, WHETHER IN AN ACTION OF CONTRACT, NEGLIGENCE OR OTHER TORTIOUS ACTION, ARISING OUT OF OR IN CONNECTION WITH THE USE OR PERFORMANCE OF INFORMATION AVAILABLE FROM THE SYSTEM.
THE INFORMATION PROVIDED BY THE SYSTEM COULD INCLUDE TECHNICAL INACCURACIES OR TYPOGRAPHICAL ERRORS. CHANGES ARE PERIODICALLY ADDED TO THE INFORMATION HEREIN. COMPANY AND/OR ITS RESPECTIVE SUPPLIERS MAY MAKE IMPROVEMENTS AND/OR CHANGES IN THE PRODUCT(S) AND/OR THE PROGRAM(S) DESCRIBED HEREIN AT ANY TIME, WITH OR WITHOUT NOTICE TO YOU.
BOISE STATE UNIVERSITY DOES NOT MAKE ANY ASSURANCES WITH REGARD TO THE ACCURACY OF THE RESULTS OR OUTPUT THAT DERIVES FROM USE OF THE SYSTEM.
Recommended Citation
Anuyah, Oghenemaro; Milton, Ashlee; Green, Michael; and Pera, Maria Soledad, "Data Set for An Empirical Analysis of Search Engines’ Response to Web Search Queries Associated with the Classroom Setting" (2019). Computer Science Faculty Scripts and Data. 7.
https://scholarworks.boisestate.edu/cs_scripts/7
Comments
Funding Information
National Science Foundation grant with Award number: 1565937
Data Attributes
.json format that includes children's queries, grade levels, the query source, and the query type (i.e., if it is a keyword, phrase, or question query).
Human Subjects: Yes
IRB Approval Number: 131-SB16-103
Files
Children Queries:
kids_queries_dataset_classified_with_grades.json
Educational Queries: related to health and science subjects: EduQRY.csv
Sexually Explicit Queries extracted from [Google's bad word list](https://code.google.com/archive/p/badwordslist/downloads): BwQRY.csv
Hate-based queries extracted from [hateBase](https://hatebase.org/) and [Davidson et al. (2017)](https://www.aaai.org/ocs/index.php/ICWSM/ICWSM17/paper/viewPaper/15665): HsQRY.csv