ArchiveOrangemail archive

java-user.lucene.apache.org


(List home) (Recent threads) (34 other Apache Lucene lists)

Subscription Options

  • RSS or Atom: Read-only subscription using a browser or aggregator. This is the recommended way if you don't need to send messages to the list. You can learn more about feed syndication and clients here.
  • Conventional: All messages are delivered to your mail address, and you can reply. To subscribe, send an email to the list's subscribe address with "subscribe" in the subject line, or visit the list's homepage here.
  • Moderate traffic list: up to 30 messages per day
  • This list contains about 42,065 messages, beginning Mar 2005
  • 14 messages added yesterday

java-user.lucene.apache.org

March 2010 - page 1
Grant Ingersoll 126743366301 Mar 2010* What would it be?...
-Arne- 126746009701 Mar 2010* Hi, I'm using Lucene 3.0.0 and have large documents to search (logfiles 0,5-20MB). For better search results the query tokens are truncated left ...
Reverse Search (3 Replies)
Mark Ferguson 126747793001 Mar 2010* Hello, I am trying to figure out the best search strategy for my situation and am looking for advice. I will be processing short bits of text (Tweets ...
Ching Zheng 126755258302 Mar 2010* Hi, I have about 50 PDF douments with size of each is around 10MB. I am using PDFbox for parsing, just wondering how I can index bookmarsk with its co...
Stanislaw Osinski 126761715603 Mar 2010 Dear All, I'm happy to announce three releases from the Carrot Search team: Carrot2 v3.2.0, Lingo3G v1.3.1 and Carrot Search Labs. Carrot2 is an o...
Justin 126764580103 Mar 2010 I've looked at this for a couple days and hope someone can offer suggestions... In the past, we overrode Scorer::score(Collector), called super.sc...
Otis Gospodnetic 126764994103 Mar 2010 Hello folks, Those of you in or near New York and using Lucene or Solr should come to "Lucene: Finite-State Queries, Flexible Indexing, Scoring, ...
suman.holani126770918804 Mar 2010* Hi, I was looking into Lucene in-memory Indexes using RAMDirectory. It has also provided with something "MMapDirectory" I want the indexes t...
Murdoch, Paul 126771442304 Mar 2010* If I have indexed some content that contains some words and a single whitespace between each word as NOT_ANALYZED, is it possible to perform a phrase ...
SpanQueries in Luke (2 Replies)
Rene Hackl-Sommer 126771766704 Mar 2010* Hi, I would like to submit SpanQueries in Luke. AFAIK this isn't doable out of the box. What would be the way to go? Replace the built-in QueryPar...
PlusPlus 126772475604 Mar 2010* Hi, I want to change the Lucene's similarity in a way that I can add Fuzzy memberships to the terms of a document. Thus, TF value of a term in one...
PlusPlus 126773199204 Mar 2010* Hi, I was wondering why TF method gets a float parameter. Isn't frequency always considered to be integer? public abstract float tf(float freq) Be...
Paul Rogers 126773382004 Mar 2010* Dear All Hope someone can help. I'm trying to run the demo's that came with Lucene (3.0.0). I extracted the tar.gz to a directory /home/paul/b...
DasHeap 126774111504 Mar 2010 Another newcomer to Lucene here. I've got the Lucene web demo up and running on my test server. The indexing and search functions are working perf...
gabriele renzi 126778285205 Mar 2010* Hi everyone, In our search app we'd perform searches with a DisjunctionMaxQuery with code like searcher.search(query, hits) Now we'd like to a...
Otis Gospodnetic 126778825005 Mar 2010* Andrzej, Does that mean the regular Lucene QP will get Span query syntax support (vs. having it in that separate Surround QP)? Or maybe that already h...
anisha@ekkitab 126779563105 Mar 2010* Hi there, Could someone help me with the usage of DuplicateFilters. Here is my problem I have created a search index on book Id , title ,and author fr...
Christopher Condit 126780962405 Mar 2010* I'm trying to store semantic information in payloads at index time. I believe this part is successful - but I'm having trouble getting access ...
halbtuerderschwarze 126782223305 Mar 2010* Hi, I just changed from Lucene 2.4.1 to Lucene 3.0.0 to use the FastVectorHighlighter, because I've large documents to search and hope for better...
OutOfMemoryError (2 Replies)
Monique Monteiro 126788456806 Mar 2010* Hi all, I’m new to Lucene and I’m evaluating it in a web application which looks up strings in a huge index – the index file contains 32GB. I ke...
Nigel 126806478608 Mar 2010* Does Lucene have any special optimization for a field that has the same value for all documents in the index? For example, rather than storing a list ...
Erick Erickson 126808896908 Mar 2010* Before I reinvent the wheel..... Is there any convenient way to, say, find all the files associated with patch XXXX? I realize one can (hopefully) get...
Rohit Banga 126814029109 Mar 2010 what is the use of the function *org.apache.lucene.queryParser.QueryParser.Query() * ? Rohit Banga...
Christopher Laux 126814297909 Mar 2010* Hi all, I'm not sure if this is the right list, as it's sort of a development question too, but I don't want to bother them over there. An...
Scott Smith 126816746109 Mar 2010* I've been updating from 2.4.2 to 3.0.1. I had a number of issues (The Version object in the analyzers was an "interesting" addition-I gu...
Justin 126816881909 Mar 2010* Should these be explicitly initialized to false? private boolean fieldSortDoTrackScores; private boolean fieldSortDoMaxScore;...
Rohit Banga 126824732210 Mar 2010* Could you help me understand how to use the explain method: http://lucene.apache.org/java/3_0_1/api/core/... I was wondering if I could use it to find...
luciusvorenus 126824755610 Mar 2010* Hi How can i extend the query to add some temporal expressions ? Thank U Lucius...
bgd 126829827511 Mar 2010* Hi, I have a bunch of documents which do not have a particular field defined. How can define a query do retrieve only those documents? Thanks!...
Isabel Drost 126832155011 Mar 2010* Call for Presentations Berlin Buzzwords http://buzzwordsberlin.de Berlin Buzzwords 2010 - Search, Store, Scale 7/8 June 2010 This is to announce the B...
Thomas Koch 126832601311 Mar 2010 Hi, are there any informations that leaked about google's index layout? How do they process my query that fast over such a vast number of document...
Nathanael D. Jones 126833749011 Mar 2010* Lucene 2.4 introduced a change not documented on the File Formats page *LUCENE-510: The index now stores strings as true UTF-8 bytes (previously it wa...
surrogate pairs (7 Replies)
Yuta Kawadai 126838099612 Mar 2010* Hi Can Lucene use surrogate pairs (and its term positions or length) ? Thanks, Yuta...
Kev Kilroy 126839095912 Mar 2010* Hi, I'm using Lucene 2.4.1 with Hibernate Search 3.1.1. I have objects in the index, for each field I index as follows: @Fields( value = { @Field(...
Justin 126840031812 Mar 2010* Hi Mike and others, I have a test case for you (attached) that exhibits a file descriptor leak in ParallelReader.reopen(). I listed the OS, JDK, and s...
Vinicius Carvalho 126840494212 Mar 2010* Hello there! We are indexing metadata for our medias. One ideia is that each user adds its own metadata, so each document may have different number/na...
Peter Keegan 126840549812 Mar 2010* Is it possible to issue a single search that combines a TopFieldCollector (MultiComparatorScoringMaxScoreCollector) with a custom Collector? The custo...
An Hong 126843760512 Mar 2010* Hi, I'm trying to download some old Lucene source, e.g., http://archive.apache.org/dist/lucene/java/lu...<http://archive.apache.org/dist/lucene...
TREC-3 Runs (1 Reply)
Ivan Provalov 126848379513 Mar 2010* Just to follow up on our previous discussion, here are a few runs in which we have tested some of the Lucene different scoring mechanisms and other op...
luocan19826164126857388214 Mar 2010* Thanks very much for your patience! Nor are the files "sync"'d.?It means something like reading disk file to RAM,so reader can see it? A...
ajay_gupta 126864514915 Mar 2010* Hi, It might be general question though but I couldn't find the answer yet. I have around 90k documents sizing around 350 MB. Each document contai...
issue querying index. (8 Replies)
Paulo Avelar 126871009016 Mar 2010* Hello, I'm using the latest Lucene 3.0.1. I have written a simple test, which does the usual, creates an index, then add 2 tests documents to it. ...
Daniel Noll 126878068116 Mar 2010* Hi all. I'm trying to implement a form of document deletion where the previous versions are kept around forever ( a primitive form of versioning) ...
Marc Sturlese 126882145917 Mar 2010* I would like to know how Lucene deals with the score on multiValued fields. I am wandering if: 1) a score is computed per field and the maximum betwee...
Richard Marr 126883075217 Mar 2010 Hi all, We're meeting up at the Elgin just by Ladbroke Grove on the 6th for a bit of relaxed chat about search, and related technology. Come along...
Murdoch, Paul 126884145917 Mar 2010* Hi, I'm using Lucene 2.9.2. Currently, when creating my index, I'm calling indexWriter.addDocument(doc) for each Document I want to index. The...
Stefan Trcek 126885159317 Mar 2010* Hello Is there an api that indicates whether a field is multivalued, just like IndexReader.getFieldNames(IndexReader.FieldOption fldOption) does it fo...
exact query match? (1 Reply)
Joachim De Beule 126887374918 Mar 2010* Hi All, I have a corpus of documents which I want to search for phrases. I only want to get those documents that exactly contain a phrase. for example...
Jamie 126889192318 Mar 2010* Hi There I have an index which is 36 GB large. When I perform eight simultaneous searches (performed by JMeter) on the index, an OutOfMemory error occ...
Paul Taylor 126889485218 Mar 2010* Hi, I'm using a custom analyser based on standardanalyser with good results to search artists (i.e rolling stones/beatles) but it fails to match s...

Next page

Home | About | Privacy