Skip to content

match scores very sensitive to repetition #8

@tylerandrewscott

Description

@tylerandrewscott

This case below shows the searched reference and the matched reference. Because the XXX structure shows up a lot in the searched authors, the match score is super high (>220) because the author in the matched reference is listed as "X X". repetition shouldn't necessarily matter for citations, so perhaps there's a way to adjust how this search is scored

                                                      title
                                                                                                                                             
1: Coast Regional Water Quality Control Board in cooperation with the California Department of Forestry;Pinel-Alloul, and E.E;Pinel-Alloul, and E.E
                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                  authors
                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                   
1: S. Nakano;M. Murakami;X.X. Neatrour;J.R.Webster M.A.;E.F. Benfield;X.X. Negishi;NA J.N.;J.S. Richardson;X.X. Neill;J.S.Buska C.R.;E.F. Chacho;C.M. Collins;L.W. Gatto;NA NA;X.X. Nicholls;R.J.Steedman K.H.;E.C. Carney;X.X. Nislow;NA K.H.;W.H. Lowe;NA NA;X.X.X. O’Brien;NA W.J.;J.J. Showalter;X.X.X.X. O’Connor;M.A.Jones J.E.;T.L. Haluska;X.X.X.X.X. O’Connor;NA M.D.;R.R. Ziemer;X.X.X.X. O'Loughlin;NA C.;R.R. Ziemer;X.X. Olsen;W.J.Spearman J.B.;G.K. Sage;S.J. Miller;B.G. Flannery;J.K. Wenburg;X.X.Oregon Forest Industries Council;Washington Forest Protection Association;X.X.X.X. Orlikowska;E. H;X.X. Oswood;M.W. NA;X.X. Oswood;A.M.Milner M.W.;J.G. Irons;X.X. Ott;R.A. NA;X.X. Ott;M.A.Lee R.A.;W.E. Putman;O.K. Mason;G.T. Worum;D.N. Burns;NA NA;NA NA;X.X.X. Patoine;B.Pinel-Alloul A.;E.E. Prepas;R. Carignan
   publisher   year journal_title    doi                            File  uq_id
                                            
1:         2001        Prepas    38021_84886_FSPLT3_4442884.json 112409
   match.title                              match.doi match.authors
                                                 
1:     Sumário https://doi.org/10.21874/rsp.v46i1.724           X X
                                   match.publisher match.year
                                                  
1: Escola Nacional de Administracao Publica (ENAP)       2015
          match.journal_title match.source match.score       oa_id
                                           
1: Revista do Serviço Público     openalex    222.8552 W4241664417 

Metadata

Metadata

Labels

enhancementNew feature or requestinvalidThis doesn't seem right

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions