[RAS] utf8 reference strings
Jose Manuel Barrueco
barrueco at uv.es
Fri Feb 27 04:00:18 EST 2009
If nobody has comments or suggestion on that I would suggest to go
ahead with the change latin-1 to utf8. If we proceede like in CitEc it
should not be necesary to reload all citations since the proporcion of
references afected is quite small...
On Sat, 21 Feb 2009, Thomas Krichel wrote:
> Jose Manuel Barrueco writes
>
>> I've managed to see correct utf8 characters in:
>>
>> CitEc database -> AMF files
>>
>> but now the problem is in the ACIS database.
>
> A bit of background here. JMBC and I have been working on the issue
> of citations lost between RAS and CitEc. It appears that there are
> issues in the character sets of the reference string
> (refstring). CitEc produced latin-1 refstrings and stuck them into
> the AMF files. We changed the column of the reference to utf-8.
>
>> There, the character set
>> used in the citations table is still latin1. I've re-processed a
>> document with problems in characters (RePEc:mar:volksw:200425) to test
>> the changes. Before, the characters were ok in ACIS but wrong in CitEc.
>> Not we have the problem in the other side. Try for instance:
>>
>> mysql> select clid,cnid,ostring from citations where ostring like
>> "SALMON, P. (2003), As%";
>>
>> Should we change the character set for ACIS too?
>
> I think this will have to be done. I am not sure how it is
> best to be done and hope that Ivan can advice. We can change
> the columns to utf-8 and reload all the citations. Maybe at
> this stage we will remove the link to the citations screen
> temporarily so that we have a chance to test things.
>
> Cheers,
>
> Thomas Krichel http://openlib.org/home/krichel
> RePEc:per:1965-06-05:thomas_krichel
> skype: thomaskrichel
>
>
---
José Manuel Barrueco http://www.uv.es/=barrueco
More information about the RAS-run
mailing list