[RAS] utf8 reference strings

Jose Manuel Barrueco barrueco at uv.es
Fri Feb 27 04:00:18 EST 2009


 	If nobody has comments or suggestion on that I would suggest to go 
ahead with the change latin-1 to utf8. If we proceede like in CitEc it 
should not be necesary to reload all citations since the proporcion of 
references afected is quite small...


On Sat, 21 Feb 2009, Thomas Krichel wrote:

>  Jose Manuel Barrueco writes
>
>> I've managed to see correct utf8 characters in:
>>
>> CitEc database -> AMF files
>>
>> but now the problem is in the ACIS database.
>
>  A bit of background here. JMBC and I have been working on the issue
>  of citations lost between RAS and CitEc. It appears that there are
>  issues in the character sets of the reference string
>  (refstring). CitEc produced latin-1 refstrings and stuck them into
>  the AMF files. We changed the column of the reference to utf-8.
>
>> There, the character set
>> used in the citations table is still latin1. I've re-processed a
>> document with problems in characters (RePEc:mar:volksw:200425) to test
>> the changes. Before, the characters were ok in ACIS but wrong in CitEc.
>> Not we have the problem in the other side. Try for instance:
>>
>> mysql> select clid,cnid,ostring from citations where ostring like
>> "SALMON, P. (2003), As%";
>>
>> Should we change the character set for ACIS too?
>
>  I think this will have to be done.  I am not sure how it is
>  best to be done and hope that Ivan can advice. We can change
>  the columns to utf-8 and reload all the citations. Maybe at
>  this stage we will remove the link to the citations screen
>  temporarily so that we have a chance to test things.
>
>  Cheers,
>
>  Thomas Krichel                    http://openlib.org/home/krichel
>                                RePEc:per:1965-06-05:thomas_krichel
>                                               skype: thomaskrichel
>
>


---
José Manuel Barrueco				http://www.uv.es/=barrueco


More information about the RAS-run mailing list