[Hippo-cms7-user] Indexing hippo:resource item in Hippo CMS 2.16.02 takes a long time
j.joachimsthal at onehippo.com
Mon Dec 6 16:05:55 CET 2010
Considering that downloading a 4 MB mp3 file from iTunes on my fibreglass
connection at home takes 4 seconds, uploading a 14 MB PDF over the internet
from an office connection may take a few seconds more.
j.joachimsthal at onehippo.com - jasha at apache.org
Europe • Amsterdam Oosteinde 11 • 1017 WT Amsterdam • +31 (0)20 522
USA • San Francisco 755 Baywood Drive Second Floor • Petaluma CA
94954 • +1 877 414 4776 (toll free)
Canada • Montréal 5369 Boulevard St-Laurent #430 • Montréal QC H2T 1S5
• +1 (514) 316 8966
www.onehippo.com • www.onehippo.org • info at onehippo.com
On 6 December 2010 15:58, William Borg Barthet
<w.borgbarthet at onehippo.com>wrote:
> I applied the patch and tested with a 14mb pdf file. It still takes a long
> time to edit the document (no timeout but it takes half a minute or so
> running locally). I verified that the patched code is running and that the
> hippo:text property is present. Should it take that long or is it that I
> messed something up while applying the patch?
> William Borg Barthet
> On Fri, Dec 3, 2010 at 1:11 PM, Ard Schrijvers <a.schrijvers at onehippo.com>wrote:
>> On Fri, Dec 3, 2010 at 1:06 PM, Jeroen Reijn <j.reijn at onehippo.com>
>> >> You can avoid this by using something like below. This is what I used
>> >> during creation of the PdfExtractionAndIndexingTest, part of the
>> >> repository engine. This solution will work both for 7.4 and trunk.
>> >> However, two patches, one for text extractors and one for tika for the
>> >> trunk is also ok imo. Hope this helps
>> > I just wanted to put the patch in, so that others can have a look. I was
>> > planning to switch to tika (for trunk), since it will allow the code not
>> > depend on any pdfbox version, but rather on the provided tika version,
>> > already comes along with the repository. I have some other plans for the
>> > upload plugin as well with the availability of tika.
>> Tika will deligate to pdfbox. Pdfbox will also already be provided by
>> the repository.
>> > Thanks for the pointers though!
>> You're welcome
>> Regards Ard
>> Hippo-cms7-user mailing list and forums
> Europe • Amsterdam Oosteinde 11 • 1017 WT Amsterdam • +31 (0)20 522
> USA • San Francisco 755 Baywood Drive, Second Floor • Petaluma, CA.
> 94954 • +1 877 414 4776 (toll free)
> Canada • Montréal 5369 Boulevard St-Laurent #430 • Montréal QC
> H2T 1S5 • +1 (514) 316 8966
> www.onehippo.com • www.onehippo.org • info at onehippo.com
> This e-mail may be privileged and/or confidential, and the sender does
> not waive any related rights and obligations. Any distribution, use or
> copying of this e-mail or the information it contains by other than an
> intended recipient is unauthorized. If you received this e-mail in
> error, please advise me (by return e-mail or otherwise) immediately.
> Hippo-cms7-user mailing list and forums
-------------- next part --------------
An HTML attachment was scrubbed...
More information about the Hippo-cms7-user